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Preface 


It is now well known that Fermat’s last theorem has been proved. For 
more than three and a half centuries, mathematicians — from the 
great names to the clever amateurs — tried to prove Fermat’s famous 
statement. The approach was new and involved very sophisticated 
theories. Finally the long-sought proof was achieved. The arithmetic 
theory of elliptic curves, modular forms, Galois representations, and 
their deformations, developed by many mathematicians, were the 
tools required to complete the difficult proof. 

Linked with this great mathematical feat are the names of TANI- 
YAMA, SHIMURA, FREY, SERRE, RIBET, WILES, TAYLOR. 
Their contributions, as well as hints of the proof, are discussed in 
the Epilogue. This book has not been written with the purpose of 
presenting the proof of Fermat’s theorem. On the contrary, it is writ- 
ten for amateurs, teachers, and mathematicians curious about the 
unfolding of the subject. I employ exclusively elementary methods 
(except in the Epilogue). They have only led to partial solutions 
but their interest goes beyond Fermat’s problem. One cannot stop 
admiring the results obtained with these limited techniques. 

Nevertheless, I warn that as far as I can see — which in fact is 
not much — the methods presented here will not lead to a proof of 
Fermat’s last theorem for all exponents. 


vi Preface 


The presentation is self-contained and details are not spared, so 
the reading should be smooth. 

Most of the considerations involve ordinary rational numbers and 
only occasionally some algebraic (non-rational) numbers. For this 
reason I excluded Kummer’s important contributions, which are 
treated in detail in my book, Classical Theory of Algebraic Num- 
bers and described in my 13 Lectures on Fermat’s Last Theorem 
(new printing, containing an Epilogue about recent results). 

There are already — and there will be more — books, mono- 
graphs, and papers explaining the ideas and steps in the proof of 
Fermat’s theorem. The readers with an extended solid background 
will profit more from reading such writings. Others may prefer to 
stay with me. 

In summary, if you are an amateur or a young beginner, you may 
love what you will read here, as I made a serious effort to provide 
thorough and clear explanations. 

On the other hand, if you are a professional mathematician, you 
may then wonder why I have undertaken this task now that the 
problem has been solved. The tower of Babel did not reach the 
sky, but it was one of the marvels of ancient times. Here too, there 
are some admirable examples of ingenuity, even more remarkable 
considering that the arguments are strictly elementary. It would be 
an unforgivable error to let these gems sink into oblivion. As Jacobi 
said, all for “Vhonneur de esprit humain.” 


August, 1997 Paulo Ribenboim 


Reader 


You may feel tempted to write your own (simpler) proof of Fermat’s 
last theorem. 

I have strong views about such a project. It should be written in 
the Constitution of States and Nations, in the Chapter of Human 
Rights: 

It is an inalienable right of each individual to produce his or her 
own proof of Fermat’s last theorem. 

However, such a solemn statement about Fermat’s last theorem 
(henceforth referred to as THE theorem) should be tempered by the 
following articles: 


Art. 1. No attempted proof of THE theorem should 
ever duplicate a previous one. 


Art. 2. It is a criminal offense to submit false proofs 
of THE theorem to professors who arduously earn 
their living by teaching how not to conceive false proofs 
of THE theorem. 


Infringement of the latter, leads directly to Hell. Return to Par- 
adise only after the said criminal has understood and is able to re- 
produce Wiles’ proof. (Harsh punishment.) 
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The Problem 


In the margin of his copy of Bachet’s edition of the works of Dio- 
phantus, | Fermat wrote: 

“Tt is impossible to separate a cube into two cubes, or a biquadrate 
into two biquadrates, or in general any power higher than the second 
into powers of like degree; I have discovered a truly marvelous proof, 
which this margin is too small to contain.” 

In modern language, this means: 


If n is any natural number greater than 2, the equation 


has no solutions in integers, all different from 0 (i.e., it has only the 
trivial solution, where one of the integers is equal to 0). 


The above statement has been called Fermat’s last theorem, or 
conjecture, or problem. 

We begin with the following remarks. 

In order to prove Fermat’s theorem for all exponents greater than 
2, it suffices to prove it for the exponent 4 and every odd prime 


!This copy is now lost, but the remarks appeared in the 1679 edition of 
the works of Fermat, edited in Toulouse by his son Samuel de Fermat. 


2 The Problem 


exponent p. Indeed, if n is composite, n > 2, it has a factor m which 
is 4 or an odd prime p. If the theorem fails for n = ml, where m = 4 
or p, 1 > 1, if x,y,z are non-zero integers such that 7” + y” = z” 
then (2')™ + (y')™ = (z')™ and the theorem would fail for m. 

Occasionally, we shall also indicate some results and proofs for 
even exponents or prime-power exponents. 

The following general remarks are quite obvious and henceforth 
will be taken for granted. 

If n is odd then X" + Y” = Z” has a non-trivial solution if and 
only if X" + Y" + Z” = 0 has a non-trivial solution. 

If x,y,z are non-zero integers such that 2” + y” = z", if d = 
gcd(z,y,z) and 2, = x/d,y, = y/d,z, = z/d then 27 4+ yf? = 27, 
where the non-zero integers 21, yi, 21 are pairwise relatively prime. 
So, if we assume that Fermat’s equation has a non-trivial solution, 
then it has one with pairwise relatively prime integers. 

Moreover, if x,y, 2 are non-zero pairwise relatively prime integers 
such that 2” + y” = 2” then 7+ y,z—2,z—y are also pairwise 
relatively prime. Indeed, if a prime p divides x + y and z — x then 
x =z (mod p), hence x” = 2” = x" + y"” (mod p) therefore y” = 0 
(mod p), so p divides y, and since p divides x + y then p divides z, 
which is contrary to the hypothesis. This shows that gcd(a + y, z — 
x) = 1. Ina similar way, we may show that gcd(a + y,z—y) = 1 
and gcd(z —2,z-—y) =1. 

Following tradition, we say that the first case of Fermat’s theorem 
is true for the odd prime exponent p when: if x,y,z are (non-zero) 
integers, not multiples of p, then 2? + y? 4 z?. 

The second case is true for the odd prime exponent p when: if 
X,Y,z are non-zero pairwise relatively prime integers, and p divides 
xyz then x? + y? # z?. As said above, in this case p divides one and 
only one of the integers 2, y, z. 

More generally, for an arbitrary integer n = 2“m, u > 0, m odd, 
we say that the first case of Fermat’s theorem is true for the exponent 
n when: if x,y,z are (non-zero) integers and gcd(m, xyz) = 1 then 
arty" #2”. 

Similarly, the second case is true for the exponent n when: if x, y, z 
are (non-zero) pairwise relatively prime integers and gcd(m, xyz) 4 1 
then «” + y” #2”. 


I 
Special Cases 


This chapter is devoted to the proof of special cases of Fermat’s 
theorem: exponents 4, 3, 5, and 7. However, we begin by considering 
the exceptional case of exponent 2. 


1.1. The Pythagorean Equation 


We study briefly the Pythagorean equation 
(li) X?4Y?7 = 77. 


A triple (x, y, z) of positive integers such that x2? + y? = z? is called 
a Pythagorean triple, for example, (3, 4,5) since 3? + 4? = 5”. 

If x,y,z are nonzero integers such that x? + y? = z? then |}, 
ly|, |z| also satisfy the same equation. Note that x,y cannot be both 
odd, otherwise z* = 1+1 (mod 4), which is impossible. Moreover, if 
d= gcd(z, y, z) then x/d, y/d, z/d also satisfy the equation. Thus, it 
suffices to determine the primitive solutions (x,y, z) of (1.1), namely 
those such that x > 0, y > 0, z > 0, x is even, and gcd(z, y, z) = 1, 
hence y and z are odd. 

It is stated in Dickson’s (1920) History of the Theory of Numbers, 
Vol. II, pp. 165-166, that Pythagoras and Plato gave methods to 
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find solutions of equation (1.1). In Lemma 1 to Proposition 29 of 
Book X of The Elements, Euclid gave a geometric method to find 
solutions of (1.1). 

Diophantus indicated how to find all solutions, as in the following 
result. 

Leonardo di Pisa (Fibonacci) also gave in 1225 an interesting 
method to find solutions. 


(1A) Jfa,b are integers such that a > b > 0, ged(a,b) = 1, a,b of 
different parity, then the triple (x,y,z), given by 


x = 2ab, 
yoo =, 
z=a +d, 


is a primitive solution of (1.1). This establishes a one-to-one corre- 
spondence between the set of pairs (a,b) satisfying the above condi- 
tions, and the set of primitive solutions of (1.1). 


ProoF. If a,b are integers satisfying the conditions of the statement, 
let x,y, z be defined as indicated. Then 

a? +y? sa 4a2b? + (a? = b?)? _ (a? + b?)? =: 2. 
Clearly x > 0, y > 0, z > 0, x is even, and gcd(z, y, z) = 1 because 
if d divides x, y, and z then d divides 2a” and 2b*, sod=1 or d= 2 
(since gced(a, Bie = 1); but d # 2 because y is odd (a,b do not have 
the same parity). 

Different pairs (a,b) give different triples (x, y, z). 

Conversely, let (x, y, z) be a primitive solution of (1.1), so 7?+y? = 
2”. From gcd(z, y, z) = 1 we have gcd(z, z) = 1. Since x is even then 
z is odd hence gcd(z — 2,z+ 2) = 1. Since y? = (z —2)(z4+2), it 
follows from their decomposition into prime numbers that z—2, z+x 
are squares of integers, say z+2=t?, z—x2 =u’, and t,u must be 
positive odd integers, with t > u > 0. Let a,b be integers such that 
2a=t+u, 2b=t—u, hencet =a+b,u=a—bwitha>b>0. So 


2 = ((a+5)? — (a—)?)/2 = 2ab, 
y? = ut? = (a— b)?(a +b)? = (a? — 0’)? so y=a?—b?, 
z =((a+b)? + (a— b)?)/2=a?4+ 0. 
We note that gcd(a,b) = 1 because gcd(z — 2, z+ x) = 1 and finally 
a+6=t is odd so a,b are not both odd. 
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For example, the smallest primitive solutions for (1.1), ordered 
according to increasing values of z, are the following: 


(4, 3,5), (12,5,13), (8,15,17), (24,7, 25), 
(20,21,29), (12,35,37), (40,9,41), (28, 45,53), 
(60,11,61), (56,33,65), (16,63,65), (48,55, 73). 


In view of (1A), to find the primitive solutions of (1.1) amounts 
to determining which odd positive integers are sums of two squares, 
and in each case, to write all such representations. Fermat proved: 
n > 0 is a sum of two squares of integers if and only if every prime 
factor p of n, such that p = 3 (mod 4), appears to an even power 
in the decomposition of n into prime factors (see the proof below). 
For every integer n which is the sum of two squares of integers, let 
r(n) be the number of ordered pairs (a,b) such that a* + b? = n, a,b 
integers not necessarily positive. For example, r(1) = 4, r(5) = 8. It 
was proved by Jacobi, and independently by Gauss, that 


r(n) = 4(d,(n) — d3(n)), 


where d;(n) (respectively, d3(n)) is the number of divisors of n which 
are congruent to 1 modulo 4 (respectively, congruent to 3 modulo 4) 
(see Hardy and Wright (1938, p. 241)). 

With this information, it is possible to determine explicitly the 
primitive Pythagorean triples (x, y, z). Now we paraphrase Fermat’s 
proof which is of historical importance. We begin with a very easy 
identity: 


(12) (a? + b’)(c? +d’) = (ac + bd)? + (ad — bc)? 
= (ac — bd)? + (ad + bc)?. 


Now we show 


(1B) A prime number p is a sum of two squares if and only if 
p=2 orp=1 (mod 4). 


ProoF. If p £4 2 and p = a? + B’, then a,b cannot both be even 
— otherwise 4 divides p. If a,b are both odd, then p=1+1= 2 
(mod 4), since every odd square is congruent to 1 modulo 4. Thus 
p = 2. If, say, a is odd and 6 is even, then p=1+0=1 (mod 4). 
Conversely, 2 = 17 + 1”, so let p = 1 (mod 4). From the the- 
ory of quadratic residues, —1 is a square modulo p, so there exists 
x,1 <a < p—1, such that x? +1 = 0 (mod p), thus 27 +1 = 
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mp, with 1 < m < p—1. Hence the set {m | 1 < m < p-— 
1, such that mp = x? + y? for some integers x,y} is not empty. Let 
mg be the smallest integer in this set, so 1 < mp < p—1. We show 
that mo = 1, hence p is a sum of two squares. Assume, on the 
contrary, that 1 < mp. We write 


{ x= cmt+ 2X, 
y =dm+%, 


with —mo/2 < 21, y1 < mo/2, and integers c,d. We observe that x, 
or y; is not 0. Otherwise mé divides x? +y? = mop, hence mo divides 
p, thus mp = p, which is absurd. We have 0 < 27+ y? < mé/44+ 
m2/4 = m2 /2 < m2 and 2? + y? = 2? +y? = 0 (mod my). Hence, 
xi + y? = mom’ with 1 < m’ < mo. But mop = x? + y?, mom! = 
ti +yj, hence mgm'p = (2° +y’)(2j+y7) = (wxityy1)?+(2yi—yr1)?. 
We also have 


rx, + yyr = «(x — emo) + y(y — dm) 
= (x? + y’) — mo(c + yd) 
= Mol, 

ry, — yt, = «(y —dmo) — y(a — emo) 
= —mo(xd — yc) 


= Mou 


for some integers t, u. Hence m'p = t? +.u?, with 1 < m! < mo. This 
is a contradiction and concludes the proof. 


(1C)_ A natural number n is the sum of two squares of integers if 
and only if every prime factor p of n, such that p = 3 (mod 4), ap- 
pears to an even power in the decomposition of n into prime factors. 
Proor. Let n = ph'-- py and assume that k; is even if p; = 3 
(mod 4). Then n = nin, where no > 1, nm, > 1, and n, is the 
product of distinct primes which are either equal to 2, or congruent 
to 1 modulo 4. By (1B), each factor of n; is a sum of two squares; 
by the identity indicated in (1.2), n; and therefore also n, is a sum 
of two squares. Conversely, let n = x? + y?; the statement is trivial 
if z = Oory = 0. Let x,y be nonzero, let d = gced(z,y), so d? 
divides n. Let n = d?n’, x = dz’, y = dy’, hence gcd(z’, y’) = 1 and 
n =x +y’. If p divides n’, then p does not divide x’ — otherwise 
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p would also divide y’. Let & be such that kx’ = y’ (mod p). Then 
a”? + y? = #?(1+ k?) = 0 (mod p). Thus p divides 1 + k?, that 
is, —1 is a square modulo p, so p = 2 or p = 1 (mod 4), by the 
theory of quadratic residues. It follows that if p; = 3 (mod 4) then 
p; does not divide n’, hence p; divides d, so the exponent k; must 
be even. 


It is customary to say that a right triangle is a Pythagorean tri- 
angle when its sides are measured by integers a,b,c. If c¢ is the 
hypothenuse, then c? = a? + b?. See also Mariani (1962). 

On this matter, we recommend Shanks’ book (1962) which con- 
tains an interesting chapter on Pythagoreanism and its applications, 
as well as the book by Sierpiriski (1962). 

In 1908, Bottari gave another parametrization for the solutions of 
(1.1). The following simpler proof is due to Cattaneo (1908): 


(1D) _—[f.a,b are odd natural numbers such that gcd(a,b) = 1, if 
s > 1 then the triple (x,y,z) given by 

x = 2°51? + 2*ab, 

y = b? + 2*ab, 

z= 278-19? + b? + 2% ab, 
is a primitive solution of (1.1). This establishes a one-to-one cor- 


respondence between the set of triples (a,b, s) satisfying the above 
conditions and the set of primitive solutions of (1.1). 


PROOF. It is clear that if x,y,z are defined as indicated then the 


triple (x,y, z) is a primitive solution of (1.1). 
Different triples (a,b, s) give rise to different primitive solutions 


(x,y,z), because 
b* = z-2, 
g2s—1g2 =2-Y, 


2°ab = ax+y-z%. 


Finally, if (z,y,z) is a primitive solution, 0 < 4 < z,0 <y< 
z, andz<a+y, because z? = 27+ y? < (4+ y)?. We write 


L=Z-4U, 
Yrs, 
zZ=xX+Y-U, 
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with u,v,w > 0. Then 


L=v+wv, 
yr=uru, 
Z=uUutrvt+w. 


From x? + y? = 2? it follows that w? = 2uv, hence w is even. Since 
gcd(u,v,w) = 1 and 2 is even and y is odd then v is even and u is 
odd. Let w = 2°w’, v = 2*v’, where vu’, w’ are odd, s > 1,t > 1. 
Then 27*w’? = 2u-2'v’ sot = 2s—1 and w” = wv’ with ged(u, v’) = 1. 
Hence necessarily u,v’ are squares: u = b”, v' = a”, and therefore 
x = 275-19? + 2%ab, y = 67 + 2°ab, z = 27° 1a? + 67 + 2°ab. 


It is also interesting to describe the solutions of 


(1.3) Ke ye S|: 


The solutions in integers are just (+1,0), (0,41). 

We shall consider the solutions in rational numbers as well as, for 
each prime p, the solutions in the field with p elements. 

Let Q denote the field of rational numbers. For each prime p, let 
F,, be the set {0,1,... ,p —1} of residue classes of Z modulo p. So, 
if a,b € Z, we have @ = b if and only if a, b have the same remainder 
when divided by p. The operations of addition and multiplication 
in F, are defined as follows: 7+ Y= x+y, TY = Ty. With these 
operations, which satisfy the usual properties, F,, becomes a field: if 
a € F, and a # 0, we have gcd(a, p) = 1, so there exist r,s € Z such 
that ar + ps = 1; then af = 1. So F is the inverse of @ in F,. For 
simplicity, we may use the notation x instead of % for the elements 
of F,. We shall indicate a result that is valid for Q as well as for 
each field F, for p > 2. Thus, let F = Q or F, (for p > 2). (More 
generally, F' may be taken to be any field of characteristic different 
from 2, that is, 1+ 14 0 in the field F.) 

Let co be asymbol, oo ¢ F’, and let T = {co} U{t € F' | 1+t? 4 O}. 
Let S= Sp={(2,y)€ Fx F|a2?+y? =1}. So the elements of $ 
are the solutions of (1.3) in the field F’. 

Let py: T — F x F be the following mapping: 


| (co) = (0,—1), 


2 1-# 


(1.4) ift € F then y(t) = ( ) 


ie ai ree eee 
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We note that since 1+?#? 4 0, then 1+? is invertible, so the mapping 
y is well defined. 


(1E) With the above notations, py is a one-to-one mapping from 


fi onto S. 
( * ) ( 25) 
1 2 1 2 z 


then y(t) € S for each t € F such that 1+t? £0. Also, (0,—1) € S, 
so y(T) CS. 

Ift€ F,1+¢? £0 then (1—?#?)/(14+ #7) 4 —-1, because 14+ 140. 
Also, if t,t2 € F,14+t7 40,1+t3 £0 then v(t) ¥ v(t), as is 
easily seen. Thus the mapping ¢ is one-to-one (because 1+ 1 # 0). 

Now we show that y(T) = S. Clearly, (0,—1) = y(co). Let 
(x,y) € S, (x,y) 4 (0,-1). If x = 0 then y = 1 and (0,1) = (0). If 
x #0 let t= (1—y)/z, so 


PROOF. Since 


2(1 — 2 —l 
pepe = L-Pe uly iu 
xv x 
and 
an 1? _ 
fenyee ot (ape 


hence (x,y) = y(t), concluding the proof. 


If F = Qthen 1+ 2? £0 for allt € Q, so T = QU {oo}. If F=F, 
with p > 2 then 1+ ?#? = 0 if and only if —1 is a square modulo 
p. According to the result of Fermat already quoted, —1 is a square 
modulo p > 2 if and only if p=1 (mod 4). 

Let N, denote the number of elements of Sz,. We have 


(1F) N,=2 and if p> 2 then 


N.= p—-1 when p=1 (mod 4), 
P™ | p+1 when p=-1 (mod 4). 


PROOF. Sp, = {(0,1),(1,0)}, so No = 2. Let p > 2. Ifp=1 
(mod 4), then there exist two elements t,, t2 such that t7+1 = t3+1 = 
0. So #(T) = (p— 2) +1=p-—1, hence by (1E), #(S) =p-1. 
Similarly, if p = —1 (mod 4), then #(T) = p+1 and #(S) = 
pti. 
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1621 


1676 


1863 


1908 


1908 


1915 


1920 


1938 


1962 


1962 


1962 


1972 
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1.2. The Biquadratic Equation 


Now we take up the case n = 4. Fermat considered the problem 
of whether the area of a Pythagorean triangle may be the square of 
an integer (observation to Question 20 of Diophantus, Book VI of 
Arithmetica). 

He was led to study the equation 


(2.1) X= Yt=7f* 
and he showed (date unknown): 


(2A) Equation (2.1) has no solutions in integers all different 
from 0. 


ProoF. If the statement is false, let (x,y,z) be a triple of positive 
integers with smallest possible x, such that x+ — y* = z?. Then 
gcd(x,y) = 1, because if a prime p divides both x, y then p* divides 
22, so p® divides z; letting x = px’, y = py’, z = p?z' then x —y"* = 
2", with 0 < 2’ < x, which is contrary to the hypothesis. 

We have z? = x*—y* = (x? +y’)(x? — y”) and gced(x? + y?, x? — y”) 
is equal to 1 or 2, as is easily seen, because gcd(z,y) = 1. We 
distinguish two cases. 


Case 1: gcd(a? + y?, x2? — y*) = 1. 

Since the product of 7?+y?, x?—y? is a square then 27+ y?, 7?—y? 
are squares; more precisely, there exist positive integers s, t, gcd(s, t) 
= 1 such that 


a? + y? = 82, 
ee ei 


It follows that s,t must be odd (since 2%? = s? + t? then s,t have 
the same parity and they cannot both be even). 
So there exist positive integers u,v such that 


rea 
v=(s—1)/2, 


and necessarily gcd(u, v) = 1, because s,t are odd. 
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We have uv = (s?—t?)/4 = y?/2 hence y? = 2uv. Since ged(u, v) = 
1 then there exist positive integers /,m such that 


w= 2, a ee 

v= m?, v = 2m’. 
We just consider the first alternative, the other one being analogous. 
So u is even, gcd(u,v,x2) = 1, and 


t)? — t)? 2 t? 
Pay @tOt (ss _ te, 


4 2 


It follows from (1A) that there exist positive integers a,b, 0 <b<a, 
gcd(a, b) = 1, such that 


27 c= 2ab; 
i? =Vweae =F, 


x =a’?+b?, 
hence 1? = ab. Thus there exist positive integers c,d, gcd(c, d) = 1, 


such that 
Cs ve, 
brad, 


and so m? = c* — d*. We note that 0 < c < a < x and the triple of 
positive integers (c,d,m) would be a solution of the equation, which 
is contrary to the choice of x as smallest possible. 
Case 2: gcd(a? + y?, 2% — y?) =2. 

Now z,y are odd and z is even. By (1A) there exist positive 
integers a,b, 0 < b < a, gcd(a,b) = 1, such that 


a = a2 +b, 
y? a? — b?, 


z = 2ab. 


Hence x7y? = a* — b* with 0 < a < @ and this is contrary to the 
choice of x as smallest possible. 


The above argument is called the method of infinite descent and 
was invented by Fermat. It may also be phrased as follows: if 
(x0, Yo, Zo) were a solution in positive integers of (2.1) then we would 
obtain a new solution in positive integers (x1, y1, 21) with z1 < Zz. 
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Repeating this procedure, we would produce an infinite decreasing 
sequence of positive integers 


Zo > By > 2g >see 


which is not possible. 

As a corollary, we obtain the original statement of Fermat, pro- 
posed as a problem or mentioned in letters to Mersenne [for Sainte- 
Croix] (September 1636), to Mersenne (May ?, 1640), to Saint- 
Martin (May 31, 1643), to Mersenne (August 1643), to Pascal (25 
September 1654), to Digby [for Wallis] (April 7, 1658), to Carcavi 
(August 1659): 


(2B) The area of a Pythagorean triangle is not the square of an 
integer. 


PRooF. Let a,b,c be the sides of the Pythagorean triangle, where c 
is the hypotenuse. So c? = a? + b?. 
Assume that the area is the square of an integer s: ab/2 = s?. 
Then 
(a+b)? = c? +4s?, 
{ (a — b)? = c? —4s?. 
Hence (a? — b?)? = c* — (2s)*, so the equation X* — Y* = Z? would 
have nontrivial solution in integers, contradicting (2A). 


We also state explicitly (this is proposed as a problem or men- 
tioned in letters to Mersenne [for Sainte-Croix] (September, 1636), 
to Mersenne (1638), to Mersenne (May ?, 1640)): 


(2C) The equation 
(2.2) X*4+y*=7' 
has no solution in integers, all different from 0. 


Proor. If x,y,z are nonzero integers such that 2+ + y* = z* then 
24 — y' = (x), which contradicts (2A). 


The above results were also reproduced by Euler (1770) and Leg- 
endre (1808, 1830). 

A companion result to (2A) is the following (see the explicit proof 
by Euler, 1770): 
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(2D) ~The equation 
(2.3) XPLY* = 7 
has no solution in integers all different from 0. 


ProoF. If the statement is false, let (x,y, z) be a triple of positive 
integers, with smallest possible z, such that 24+y* = 27. As in (2A), 
we may assume gcd(z, y) = 1. We also note that x, y cannot be both 
odd, otherwise z? = x* + y? = 2 (mod 4) and this is impossible. So 
we may, for example, assume x to be even. 

From (a?)? + (y?)? = z? it follows that (2?,y?,z) is a primitive 
solution of (1.1). By (1A), there exist integers a,b, such that a > 
b > 0, gcd(a, b) = 1, a,b are not both odd and 


f= Dab: 
y2 = a2—0?, 
| z =a’?+b?. 
Moreover, 6 must be even. For if b is odd, then a is even, y? = 
a? — b? = —1 (mod 4), which is impossible. 
Now we consider the relation b?+y? = a’, where y, b, a are positive 


integers, b is even, and gcd(b, y,a) = 1. By (1A), there exist integers 
c,d such that c > d > 0, ged(c, d) = 1, c,d of different parity and 


b = 2Qcd, 
7] Sead, 
a=c+d. 


Therefore x? = 2ab = 4cd(c? + d*). But c,d, c? + d are pair- 
wise relatively prime. By the decomposition of x? into primes, we 
conclude that c,d, c? + d? are squares of positive integers, say: 


Hence 
(2.4) pt+ge=r, 


that is, the triple (p,q,r) is a solution of (2.3). But z= a?+b? = 
(c? +d’)? +4c?d? > r4 > r (since r > 1). This contradicts the choice 
of z as minimal possible, and concludes the proof. 
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TABLE 1. FLT for the exponent 4. 


Author Year 

Frénicle De Bessy 1676 

Euler 1738 (publ. 1747), 1771 
Kausler 1795/6 (publ. 1802) 
Barlow 1811 

Legendre 1823, 1830 
Schopis 1825 

Terquem 1846 

Bertrand 1851 

Lebesgue 1853, 1859, 1862 
Pepin 1883 
Tafelmacher 1893 

Bendz 1901 

Gambioli 1901 

Kronecker 1901 

Bang 1905 

Bottari 1908 

Rychlik 1910 

Nutzhorn 1912 
Carmichael 1913 

Vranceanu 1966 


Other proofs of Fermat’s theorem for the exponent 4 are given by 
the authors listed in Table 1. Now we indicate a statement which is 
equivalent to Fermat’s theorem for the exponent 4 (see Vranceanu, 
1979): 


(2E) = The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent 4. 
(2) For every integer m # 0 the only solutions in nonzero inte- 
gers of 2X* = mY(m?4+ Y?) are (m,m) and (—m,m). 


Proor. (1) — (2) Let m #0 and assume that there exist nonzero 
integers u,t such that 2u* = mt(m?+??). Let x = 2u, y=t—m, z= 
t+m. Then 24 —y* = (z-y)(z+y)(2? +?) = 2m- 2t(2t? + 2m?) = 
8mt(t? + m2) = 16u* = zt. 

By hypothesis, xyz = 0. If « = 0 then y = +z hence m = 0, 
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contrary to the hypothesis. If y = 0 then t = m, x = +z,sou= +m. 
If z= 0 then x = 0, again contrary to the hypothesis. 

(2) + (1) Ifat+y* = 24 then 224 = 2(z4 — y*) = 2(z-—y)(e+ 
y)(w? +y?) = (z—y)(z+y) [(2— y)? + (2 + y)?]. So taking m = z—y 
then t= z+y, u=72 satisfy the relation 2u* = mt(m?+¢??). If m or 
u or t is equal to 0 then « = 0. If m,t,u 4 0, by hypothesis t = m 
hence y = 0, an absurdity. 


We conclude this section by illustrating how the method may be 
applied to find solutions of some similar diophantine equations. 


(2F) The equation 
X*—4Y*= 427? 


has no solution in nonzero integers. 


ProoF. It suffices to consider the equation X*—4Y* = Z?. Because 
if x,y,z are nonzero integers such that x* — 4y* = —z? then 42+ — 
(2y)* = —(2z)?, so (2y)* — 4x4 = (2z)? and (2y,x,2z) would be a 
solution of the first equation. 

Now, if x,y,z are positive integers such that x* — 4y* = z? and 
gcd(x, y, z) = 1 (as we may assume without loss of generality), there 
exist integers a,b with a > 6 > 0 and 


2y? = 2ab, 
22 SP 
Babe, 


Since gcd(a, b) = 1 then a,b are squares, say a = c*, b = d?. Hence 
x? = c' +d", and this relation is impossible by (2D). 


Legendre proved: 


(2G) Ifx,y,z are nonzero integers and 
at + yt = 227, 

then x? = y” and z? = z*. 

PROOF. We have 


424 = (x* +y') = (x* = y')? + 4aty’*, 
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SO 


4 A\ 2 
4,44 (2% —Y 
fa aty (—) 


(in particular, 2’ — y* is even). Since x,y, z are not zero, by (2A) we 
have x4 = y*, so 2? = y*. Then z? = x%. 


(2H) If x,y,z are nonzero integers and 2x* + 2y* = 27, then 
CH= 2 a=. 


PRooF. Multiplying by 8 we have (2x)* + (2y)* = 2(2z)?. By (2G), 
a)" = (26) and 27)" = On) 0 Syne = 4a 


The next result is due to Lucas (1877). We present here an easier 
proof due to Oblath (1952): 


(21) The equation 
4x*—1=3Y* 


has only the trivial solutions (£1,+1) in integers. 


ProoF. If x,y are integers such that 3y4 = 4a4—1 = (2x?41)(2x?— 
1), since 22 — 1 #0 (mod 3) then there exist integers a,b such that 
227 +1 = 3at, 2x? —1 = b*. By (2G), the last equation is only 
satisfied when x = +1, b= +1, hence y = +1. 
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1.3. Gaussian Numbers 


We shall prove that X* + Y* = Z? has no solution in nonzero inte- 
gers of the Gaussian field. This result is explicitly proved in Hilbert’s 
Zahlbericht (1897, Theorem 169); see also Sommer (1907) and Han- 
cock (1931). 

The set of complex numbers a = a+ bi, where i = /—1 and 
a,b € Q, constitutes the Gaussian field kK = Q(t). The numbers 
a =a-+i, where a,b € Z, are called the Gaussian integers. They 
constitute a ring, denoted by A = Z[i]. 

If a, 6 € K, @ divides a if there exists a Gaussian integer y such 
that a = By. We write @ | a when G divides a. Two nonzero 
Gaussian integers a, 2 are associated when a divides 3 and ( divides 
a; we write a ~ 3. The Gaussian integers associated with 1 are 
called the (Gaussian) units. It is easily seen that they are +1, +i. 

A nonzero Gaussian integer a is a prime if it is not a unit and 
the only Gaussian integers dividing a are units or associated with 
a. In the field of Gaussian numbers, every nonzero Gaussian integer 
a is the product of prime Gaussian integers: @ = 71yo°-:7s- This 
decomposition is unique in the following sense: if we also have a = 
6,62---6,, where each 6; is a prime Gaussian integer, then s = t, and 
changing the order if necessary, y; and 6; are associated (for every 
(al PCr 9 

Therefore we may define, in an obvious way, the greatest common 
divisor of nonzero Gaussian integers, which is unique up to units. 

If a,G,7 are Gaussian numbers and 7 # 0, we write a = § 
(mod y) when y divides a — 3. The congruence relation = satisfies 
the same properties as the congruence for ordinary integers. The 
Gaussian integer \ = 1 —7 is a prime and 2 = 7”, so \? | 2 but 
d? 2. We have 1+7= i(1 —72) = id. 

There are precisely four distinct congruence classes modulo 2, 
namely the classes of 0, 1, 7, and A. Indeed, these numbers are pair- 
wise incongruent modulo 2. On the other hand, according to the 
parity of a,b, we deduce that a+bi is congruent to 0, 1, 7, or A, mod- 
ulo 2. In particular, if A/a = a+ bi then a = 1 (mod 2) ora =i 
(mod 2). Then, a? = +1 (mod 4) and a* = 1 (mod 8), that is, 
a’ =1 (mod )°) since 8 = —iX°. 

Now we show: 
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(3A) The equation 
X44Y'=7 


has no solution in Gaussian integers all different from zero. 


Proor. Let €,7,@ € Z/i] be nonzero and such that €4+1* = 6?. We 
may assume without loss of generality that gcd(,7) = 1. Indeed, if 
6 = gcd(€,7), then € = 6€', 1 = 6m’, with €',7' € Zi], gcd(€', 7’) = 
1; so 6* divides 67, hence 6? divides 6, we may write 6 = 676’, with 
6' € Ziti]. Hence é* + /* = 0” where gcd(€’, 7’) = 1. 

From gcd(€,7) = 1 it follows that €,7,6 are pairwise relatively 
prime. We consider two cases. 


Case 1: 2 does not divide 7. 

By a preceding remark €* = 1 (mod A°), n* = 1 (mod A°) so 
6? = £4474 = 2 (mod A°). Since 2 = id? then ? | 67, hence 
d | 0. However, \? / 6, because \* / 2. We write 6 = A0,, where 
\ 6,. Thus 707 = 2 = id? (mod A°) hence 6? = i (mod 4‘), and 
therefore 0/ = —1 (mod A°) since \* ~ 4, \®° ~ 8. However, A / 0, 
hence 6{ = 1 (mod A°), which would imply by subtraction that 2 = 0 
(mod \°), and this is absurd. 


Case 2: divides €. 

Hence \ 6. We write € = AE’, with m > 1, €’ € Zi], and A /é’. 
The essential part of the proof consists in showing the following 
assertion: 

Let n > 1 and let € be a unit of Zi] (so ¢ = 1 or +2). If there 
exist a, 3,y € Zi], pairwise relatively prime, not multiples of A, and 
eM" a* + 84 = 7? then: 

(a) n > 2; and 
(b) there exists a unit €; and a1, 01,71 € Zi], pairwise relatively 
prime, not multiples of A, such that 


ey MOVat + 4 = 42. 


The hypothesis is satisfied with n= m,ce=1,a=€',6=7n, 7 
6. By repeated application of the above assertion, we would find a 
unit <’ and a’, 3’, € Zi], pairwise relatively prime, not multiples 
of A, such that 
e’ Mal* 4 gl = nf? 


This contradicts (a) above. 
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First we show that n > 2. Indeed €A*"a* + G4 —1= 77 — 1 and 
since A / 3, then G4 = 1 (mod A°), so 7? = 1 (mod A*). But A /y7 
hence y =7 (mod 4?) or y =1 (mod A?). In the first case, y? = —1 
(mod A*) hence \* would divide 2, a contradiction. So y — 1 = A? 
where yp € Ziti] and hence y+ 1 = A? +2 = A*(w+7). But either 
A | wor if A) w then yz = i (mod X) because 1 = 7 (mod A); so 
js = —i (mod X). We have shown that in any case A | (fs +7) so X° 
divides y? — 1 = A*yu(2 +7), hence A° divides ¢A*"a* + (G4 — 1); but 
A® divides 34 — 1, A a, hence A° | A*" so n > 2. 

Now we prove (b). We have e\*"a4 = 7? — 34 = (y — B?)(74+ 8). 
We note that gcd(y — 3°, y + 6?) = »”. Indeed X must divide one 
of the factors in the right-hand side, hence it divides both factors, 
because (A + 3?) — (A — 6?) = 26? is a multiple of 7. Since + 
divides the right-hand side, this implies necessarily that \? divides 


both factors: 
YF B= rv, 
yt 3? = ayn ep! 
where v,v’ € Z[i] and ged(v,v’) = 1. Hence cA*"a*t = \4" vv" so by 
the uniqueness of factorization up to units, v,v’ must be fourth pow- 
ers, up to units v = we4, »’ = w'k!*, where Kk,’ € Zi], gcd(K, ’) = 
1, w,w’ are units. Thus 
7 — B= wr*nt, 
y+ B - w! A4n—2 4/4 


and subtracting, 
23? = wy! 4-2/4 _ wr «4. 
Hence 
ia = iw! \4"-4 4/4 as iwk*, 
so 
Te ame rs aK" ~ B’, 
with units w, = —iw’, w; = iw. We show that w; = 1, which suffices 
to establish statement (b). Since n > 2 then A* | 6? — w,K*; but 
A} 8B, hence A) « hence «+ = 1 (mod A°) so A* | 6B? — w,. But r° 
divides 6+ — 1 = (6? — 1)(6? +1) hence 6? = 1 or — 1 (mod 4%). 
This shows that w,; = +1. If w, = —1, by multiplication with —1, 
we obtain the relation 


ar M(r-D 4 + =a (i8)?. 
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So, in all cases, we have shown (b), proving the statement. 
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1.4. The Cubic Equation 


Fermat proposed the problem to show that a cube cannot be equal 
to the sum of two nonzero cubes. See letters to Mersenne [for Sainte- 
Croix] (September, 1636), to Mersenne (May ?, 1640), to Digby [for 
Wallis} (April 7, 1658), to Carcavi (August, 1659), all mentioned 
in the Bibliography of Section I.2; see also a letter to Digby [for 
Brouncker] (August 15, 1657). 

Euler discovered a proof of this statement. It used the method 
of infinite descent and appeared in his book on Algebra, published 
in St. Petersburg in 1770, translated into German in 1802, and 
into English in 1822. A critical study of Euler’s proof uncovered 
an important missing step, concerning the divisibility properties of 
integers of the form a? + 3b?. We note that in his paper of 1760, 
Euler had already proved rigorously that if an odd prime number 
p divides a? + 3b? (where a,b are nonzero relatively prime integers) 
then there exist integers u,v such that p= u? + 3v?. Yet, Euler did 
not establish in full the Lemma 4.7 which is required in the proof. 
Legendre reproduced Euler’s proof in his book (1808, 1830) without 
completing the details. 

In 1875, Pepin published a long paper on numbers of the form a+ 
b\/—c pointing out arguments which had been insufficiently justified 
by Euler concerning numbers of the form a? + cb’, especially for c = 
1,2,3,4,7. Schumacher (1894) noted explicitly the missing link in 
the proof. In 1901 Landau offered a rigorous proof; this was again the 
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object of Holden’s paper (1906) and, once more in 1915, a detailed 
proof appeared in Carmichael’s book.! In 1966, Bergmann published 
a paper with historical considerations and a thorough analysis of 
Euler’s proof. Once more, in 1972, R. Legendre pointed out that 
Euler’s proof was not perfect. In his book, Edwards (1977) discusses 
also this proof. 


(4A) The equation 
(4.1) X°+Y2?+Z77=0 
has only the trivial solutions in integers. 


PRooF. Assume that x, y, and z are nonzero, pairwise relatively 
prime integers such that 2? + y? + z* = 0. Then they must be 
distinct (because 2 is not a cube) and exactly one of these integers 
is even, say x,y are odd and z is even. Among all the solutions with 
above properties, we choose one for which |z] is the smallest possible. 
We shall produce nonzero pairwise relatively prime integers J, m,n 
which are such that I? + m? +n? = 0, n is even, and |z| > |n|. This 
will be a contradiction. Since x + y, x — y are even, there exist 
integers a,b such that 2a = x+y, 2b=2-—y;sor=a+b,y=a—b 
and therefore a,b 4 0, gcd(a,b) = 1 and a,b have different parity. 
Then —2? = a2 +y? = (a+b)? + (a — b)? = 2a(a? + 307). But 
a? + 3b? is odd and z is even, hence 8 divides z*, so 8 divides 2a, 
so b is odd. We have gcd(2a, a” + 3b”) equal to 1 or 3. In fact, if 
p*(k > 1) is a prime power dividing 2a and a? + 3b? then p 4 2 so 
p* divides a, hence 3b?; but p does not divide b, sok = 1 and p=3. 
Now we consider two cases. 


Case 1: gced(2a, a? + 3b?) = 1. 
Then 3 does not divide a. From —z? = 2a(a? +307) it follows from 
the unique factorization of integers into primes that 2a and a? + 3b? 


are cubes: 
264": 
a? + 3b? = 83, 


where s is odd and not a multiple of 3. At this point we make use 
of a fact to be justified later: if s is odd and s* = a? + 3b? with 


1A proposed simplification of Euler’s proof by Pizé (1955) is wrong, as 
pointed out by Yf (1956). 
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gcd(a,b) = 1, then s also must be of the form s = u? + 3v?, with 
u,v € Z, and 
a = u(u? — 9v?), 

{ b = 3u(u? — v?). 
Then v is odd, u is even (because b is odd), u 4 0, 3 does not 
divide u (since 3 does not divide a) and gcd(u,v) = 1. Therefore, 
2u, u+ 3v, u — 3v are pairwise relatively prime and from r? = 2a = 
2u(u — 3v)(u+ 3v) it follows that 2u, u — 3v, u+ 3v are cubes: 


2u = —n3, 
u—3v =8, 
ut+3v =m, 


with /,m,n distinct from 0 (since 3 does not divide u) and pairwise 
relatively prime. We conclude that 


P4im'>+n3 =0, 
where n is even. Now we show that |z| > |n|. In fact, 
|z|> = |2a(a? + 3b?)| = |n?(u? — 9v?)(a? + 3b7)| > 3|n?| > |n3| 


because u? — 9v* = I?m? # 0 and b F 0, since it is odd. This 
contradicts the minimality of |z|. 
Case 2: gcd(2a, a? + 3b?) = 3. 

We write a = 3c. Thus, c is even and indeed 4 divides c, while 3 
does not divide b (since a, b are relatively prime). So —z* = 6c(9c? + 
3b?) = 18c(3c? + b*) where gcd(18c, 3c? + 6?) = 1. Indeed, c is even 
and 6 is odd, therefore 3c? + b? is odd, 3 does not divide 3c? + b? and 
gcd(b,c) = 1. By the unique factorization of integers into primes, 
18c and 3c? + 6? are cubes: 

18c = r°, 
3c? + b? = 83, 
where s is odd and 3 divides r. By the same result already quoted, 
s =u? + 3v? with u,v € Z and 


{ b= u(u? — 9v?), 


c = 30(u? — v?). 


Thus uw is odd, v is even (since b is odd), v 4 0, ged(u, v) = 1. Also, 
2v,u+v,u-—v are pairwise relatively prime. From r? = 18c = 
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54u(u + v)(u — v) we deduce that (r/3)? = 2v(u+ v)(u — v) and 
2uv, ut+tv, u—v are cubes: 


2v = —n, 
utv=Ph, 
u—-v=—m>. 


Thus 1/? + m3? + n° = 0 with 1,m,n different from 0, and n even. 
Now we show that |z| > |n|. In fact, 


|z|> = 18|c|(3c? + 0’) 
= 5A4lv(u? — v?)|(3c? + b?) 
= 27|n|*|u? — v?|(3c? + 6?) 
> |nl. 
Since u? — v? = —I°?m? # 0, |3c? + b?| > 1. Again, this contradicts 


the choice of |z| as minimal. 


We shall now justify the step concerning the expression of s as 
s =u’ + 3v”. For this purpose, we use arguments, already known to 
Fermat, in connection with the study of integers of the form u?+v?. 

Let S' be the set of integers of the form a? + 3b? (a,b € Z). S is 
closed under multiplication, because 


(4.2) (a? + 3b?) (c? + 3d?) = (ac + 3bd)? + 3(ad $ bc)? 
(the equality holds with corresponding signs). 
LEMMA 4.1. Let p be a prime different from 2 and 3. Then the 
following conditions are equivalent: 
(1) p=1 (mod 6). 
(2) —3 is a square modulo p. 


3) The polynomial X? + X +1 has a root in F,. 
Pp 


PROOF. For the equivalence of (1) and (2) we compute the Legendre 
symbol, using Gauss’ reciprocity law: 


=) Q)-crrco-)-() 


So (—3/p) = +1 if and only if p= 1 (mod 3), that is, p= 1 (mod 6). 
For the equivalence of (2) and (3), we write 


2 
X?4X41=(X+4) 43. 
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If there exists a € F, such that a? +a+1=0 then —3 = 4(a— Lye 
and conversely, if —3 = 6? with G € F,, we take a = —5 + 3/2 so 
a+a+1=0. 


1 
2 


LEMMA 4.2. If k is a nonzero integer, if p is a prime, and p = 
7+3d2 € S, pk =a?+3b? € S then p divides ac + 3bd and ad F bc 
(with corresponding signs ) and 


ae 2 
i (<= =) 43 (“=*) a 
Pp Pp 


PROOF. We have 


_ (a? +38?)(c? + 3d?) _ (¥ oe a 7 ( ad + bc ) 
= eaeagne  ~ \aha Bae C438 


by (4.2). But (ac + 3bd)(ac — 3bd) = a?c? — 907d? = a?(c? + 3d”) — 
3(a? + 3b*)d? = (a? — 3kd?)(c? + 3d”). Since c*? + 3d? = p is a prime 
then, say, p divides ac + 3bd, that is, (ac + 3bd)/p € Z. Hence also 
3 ((ad — bc)/p)* € Z and therefore (ad — bc)/p € Z, thus k € S. 


LEMMA 4.3. If p is a prime then p € S if and only ifp=3 orp=1 
(mod 3). 


ProoF. If p = a? + 3b?, p 4 3, then b 4 0, so p = a” (mod 3), and 
3 /a; thus p = a? =1 (mod 8). 

Clearly 3 € S. Let p = 1 (mod 3). Since (—3/p) = 1 then 
there exists t such that 0 < t < p/2 and —3 = t? (mod p). Then 
mp=t?+3< (p/2)? +3 <p? so0 <m< p. Now we observe that 
for every t > 1 there exists at most one prime p # 2,3 such that 
p|t?+3 but p /u? +3 for every u,1<u<t. 

Indeed, we assume that there exist distinct primes p,p’ as above, 
p <p’. By the preceding remark, we must have 0 < t < p/2, and 
?+3 = pm with 0 < m < p. Since p’ | #? +3 then p’ | m so 
p' <m <p, which is a contradiction. 

Now we are ready to prove the statement. Suppose there exists a 
prime p, p= 1 (mod 3), such that p ¢ S. We take the smallest such 
prime p. Let t > 1 be the smallest integer such that p | t? + 3, so 
0<t< p/2, t??+3 =mp with 0 < m < p. Ifp’ is any prime dividing 
m,m=p'm’', then p' <m < p,sop’ € S. From p'(pm’) = pm = t?+ 
3 € S it follows from Lemma 4.2 that pm’ € S. If m’ =1 then pe S, 
as we intended to show. If p” is a prime dividing m’, m’ = p"”m", 
then p” < m’ < pso p” € S, hence p’(pm") = pm’ € S and by 
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Lemma 4.2, pm” € S, where m” < m’. Repeating this argument, we 
eventually arrive at p € S. 


It is worth giving another proof, using Dirichlet’s pigeon-hole prin- 
ciple, of the fact that if p= 1 (mod 3) then pe S. 

From p = 1 (mod 3) there exists t, 1 < t < p—1, such that 
—3 =? (mod p). We consider the set of all pairs (m,n) such that 
0 <m, n < [\/p]; since there are more than p such pairs, then there 
exist two distinct pairs (m,n), (m’,n’) such that m+ nt =m’ + n't 
(mod p). Som#m’andn# n’, say n > n’, hence t = (m'—m)/(n 
n') (mod p); since 0 < n— 1! < \/p and 0 < |m'—™m| < \/p, then 
t = +a/b (mod p) with 0 <a < \/p,0<b< \/p. Thus a? + 3b? =0 
(mod p) and we may write a? + 3b? = kp, with 0 < k <4. It follows 
that a? = k (mod 3) hence k = 0 or 1 (mod 3), that is, k = 1 or 3. 
Ifk =1 then pe S. If k = 3 it follows that 3 | a, so a = 3a’ and 
dividing by 3, p= 02+ 3a € S. 


LEMMA 4.4. Let m = u? + 3v”, with u,v £0, gcd(u,v) = 1. If p is 
an odd prime dividing m then pe S. 


PROOF. 3 € S, so we may assume that p 4 3. Since p divides m then 
p does not divide v, otherwise it would also divide u, contrary to the 
hypothesis. Let v’ be such that vv’ = 1 (mod p). So (wv’)? = — 

(mod p) and (—3/p) = 1, that is, p = 1 (mod 3). By Lemma 4.3, 
pes. 


We complete the above lemmas as follows: 


LEMMA 4.5. If p is a prime, p € S, then its representation in the 
form p = a? + 3b? (with a> 0, b> 0) is unique. 


PrRooF. We apply Lemma 4.2 with k = 1, thus p = a?+3b? = c?+3d? 
(where a,c > 0,b>0, d> 0). Hence 


mie 2 2 
eet (“ + =) aa (<— “) 
Pp Pp 


so p = ac+3bd, ad = bc. Therefore pd = acdi3bd? = +bc?+3bd? = 
+b(c? + 3d?) = tbp. Hence d = +b thus b = d, hence a=. 


LEMMA 4.6. Letm = 3 or m= u?+3v?, with u,v 4 0 and gcd(u, v) 
= 1. [fm is odd and m = JJ, ps’ (where pi,... , Pn are primes 
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and e; > 1) then there exist integers a;,b; (i = 1,...,n) such that 
pi = a; + 3b; and 


utovy—-3 = [[ + bjV/—3)%. 
t=1 


ProoF. The proof is by induction on m. It is trivial when m = 3. 
Let m > 3, so m = u* + 3v?, with u,v 4 0, ged(u,v) = 1. Let pbea 
prime dividing m, and m = pk. By Lemma 4.4, p = a? + 3b?, and by 
Lemma 4.2, k = c? + 3d? where c = (ua + 3vb)/p, d = (ub ¥ va)/p 
(with corresponding signs). We also have (a + bV3)(c + dV—3) = 
(ac + 3bd) + (be — ad),/—3 where 


ac + 3bd = —(ua? + 3vab + 3ub? F 3vab) = u, 


1 
+(be— ad) = = ee + 3ub? — uab + va?) = v, 


that is, 


(a+ b/=3)(eF dy/—3) = ut v=. 


If & = 1, it is trivial. If k A 1 then either k = 30rk #3. In 
this case, c # 0 (otherwise c = 0, so d divides u,v, hence d = 1 
and k = 3, contrary to the hypothesis); similarly d 4 0 (otherwise 
d = 0, so c divides u,v hence c = 1 and k = 1, contrary to the 
hypothesis); moreover, gcd(c,d) = 1, because gcd(u,v) = 1. By 
induction, the result is true for k, hence c = d\/—3 is expressible in 
the form indicated. Since (a + b\/—3)(c = d/—3) = u+vy—3 then 
the result also holds for m. 


LEMMA 4.7. Let E be the set of all triples (u,v, s) such that s is odd, 
gcd(u,v) = 1 and s? = u? + 3v?. Let F be the set of all pairs (t, w) 
where gcd(t,w) = 1 andt #w (mod 2). The mapping ®: F — E 
given by ®(t,w) = (u,v, s) with 

u=t(t? —9w’), 

v = 3u(t? — w?), 

s=t?+3u”, 
is onto E. 


ProoF. It is clear that wu? + 3v? = s?. Since t,w have different 
parity, then s is odd. Next we show that gcd(u,v) = 1. Indeed, 
first we note that gcd(t? — 9w?, t? — w?) = 1 because if a prime p 
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divides t? — 9w? and t? — w?, it divides 9t? — 9w? so also 8t?, hence 
p = 2 (since p cannot divide t because gcd(t,w) = 1). Since t,w 
have different parity, this is impossible. Now we assume that p is a 
prime, e > 1 and p® divides u and v then p | t or p | t? — 9w? hence 
p | t in both cases; so p / w(t? — w?) hence p = 3. From 3° | v since 
3|¢ then e = 1 thus gced(u, v) = 1 or 3. If 3] u, 3| v then 3| t, 3 /w 
so 3 | s but 3? /s. However s? = u* + 3v? so 3? | s? hence 3? | s, 
which is a contradiction. This shows that ®(t, w) = (u,v,s) € E. 

Conversely, given (u,v,s) € E, let s*? = []j_, p;' be the decom- 
position of s* into a product of primes (p,,... , Pn distinct, e; > 1); 
so e; = 3e); for every 7. By Lemma 4.6 there exist integers a,, b; 
(¢=1,...,n) such that p; = a? + 3b? and 


ut+vuV/—3 = Ilta + by/—3)% 


i=1 


Let t,w € Z be defined by the relation 


[[ + b,/—3)*% =t+ wV—3, 
t=1 
so u+vy/—3 = (t + wV—3)*. Computing explicitly the cube in the 
right-hand side, it follows that u = t(t? —9w?), v = 3w(t? — w?). Fi- 
nally, A gutens conjugates, u—v\/—3 = (t—w/—3)?, so by multiply- 
ing, 5° = u?+3v? = (t?+3w?)3, hence s = t?+3w?. It follows that t, u 
have different parity and also gcd(t, w) = 1, ®(t,w) = (u,v, s). 


In this way we have established all the steps in Euler’s proof of 
(4A). 

Now we prove the following result due to Kronecker (1859); see 
also Vranceanu (1956, 1960). It is a consequence of Fermat’s theorem 
for the exponent 3. 


(4B) 
(1) For every integer m 4 0 the only solutions in integers of the 
equation 4X° — 3mY? = m® are (m,m) and (m,—m). 
(2) The only rational solutions of 4U* + 27T? = —1 are (—1, 4) 
and (—1,—3). 
(3) X8>-X+ 3 are the only cubic polynomials with rational co- 
efficients such that the sum of roots is equal to 0 and the 


discriminant equal to —1. 
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(4) If the discriminant of a cubic polynomial with rational coef- 
ficients is the sixth power of a nonzero rational number, then 
its roots are of the form r+sv/3sin(7/9), r+sV/3sin(27/9), 
r — sV3sin(47/9). 


PRooF. (1) If2,y are integers and (z, y) satisfies 47° -—3my? = m® 
then letting u = —2z, v = y+m then u? 4+ v? = —8x3 + y? + 3y?m + 
3ym? + y? = —2m? — 6my? + y? + 3y?m + 38ym? + y? = (y—m)?. 
Thus either x = 0 (which would imply —3y? = m?, an absurdity) or 


y = +m; in this case we have necessarily « = m 

(2) Let u,t be rational numbers, such that 4u® + 27? = —1. 
We write u = ain, t = y/3m, so —423 + 3my? = —m?. By (1) w 
have x =m, y = +m, hence u = —1, t=+3 


(3) If X?+aX +6 has rational aoctncieuts and discriminant 
6 = —1, since 6 = 4a® + 27b? then by (2), a= —1, b= +3. 

(4) If f(X) = X%+a,X?+a.X +a has Fational coeflicients, if 
g(X) = f (X — a,/3) then g(X), f(X) have the same aiscnmn inant 
and g(X) is of the form g(X) = X°+uX +t with rational coefficients. 

If the discriminant is a sixth power of a nonzero rational number, 
say —(4u3 + 27#2) = r®, then 4(u/2)* + 27(t/r?)” = —-1. Hence 
= 2074 SP 3, 86 


G(X) = XP -PX+ r =? (2) (=) + 1 : 


The roots of the polynomials 


Xe-X4+ 
are +(2\/3/3) sin(7/9), +(2V3/3) sin(27/9), (2/3/83) sin(47/9), 


hence those of f(X) are of the form indicated. 


wile 


Conversely, in 1944 Schmid established the equivalence below and 
proved Fermat’s theorem for the exponent 3 by showing directly the 
validity of (2); see also Vranceanu (1956, 1960, 1979) where this fact 
is explicitly spelled out: 


(4C) The following statements are equivalent: 
(1) Fermat’s last theorem is true for the exponent 3. 
(2) For every integer m # 0 the only solutions in integers of the 
equation 4X? — 3mY? = m® are (m,m), (m,—m). 
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TABLE 2. FLT for the exponent 3. 


Author Year 
Kausler 1795/6, publ. in 1802 
Legendre 1823, 1830 
Calzolari 1855 
Lamé 1865 

Tait 1872 
Gunther 1878 
Gambioli 1901 

Krey 1909 
Rychlik 1910 
Stockhaus 1910 
Carmichael 1915 

van der Corput 1915 

Thue 1917 
Duarte 1944 


3) The only rational solutions of 4U* + 27T? = —1 are (—1,+), 
(-1,-3) 
S127): 


PROOF. We have seen in (4B) that (1) implies (2) and also that (2) 
implies (3). Now we assume that statement (3) is true and we shall 
derive that Fermat’s theorem is true for the exponent 3. 

Assume, on the contrary, that there exist nonzero, pairwise rela- 
tively prime integers x,y,z such that x? + y? = 23, soy 4 z. Let 


u = a/(y—z),t = (y+ z)/(3(y — z)). Then 4u® + 27t? = —1, 
as is easily seen. So, by assumption, u = —1,t = +1/3, hence 
y—z = +(y+ z); this leads to y = 0 or z = 0, contrary to the 
hypothesis. 


In 1885, Perrin showed that if X? + Y?+ Z? = 0 has a nontrivial 
solution in nonzero relatively prime integers then it would have an 
infinite number of such solutions, which are obtainable from the 
assumed solution by means of rational operations. Of course this 
statement is not of interest since there are no solutions of the type 
indicated. 

Proofs of Fermat’s theorem for the exponent 3 were also published 
by the authors listed in Table 2. 

We conclude this section with the study of an equation similar to 
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(4.1). With the method of infinite descent, we show (see Legendre, 
1808, 1830): 


(4D) For every m > 0 the equation 
(4.3) AP+ Ye =9"7* 


has only the trivial solutions in integers; namely, the solutions are 
(x,y,z) with cyz =0 and ifm =1 also (a,2,x) with any x £0. 


PRooF. Assume that x,y, z are nonzero integers, that x,y, z are not 
equal if m = 1, and that x? + y? = 2™z?; we may assume without 
loss of generality that gcd(, y, z) = ged(x, z) = gced(y, z) = 1. 

If m = 3m! with m’ > 0, (a, y, 2” z) would be a nontrivial solution 
of the equation X? + Y? = Z*, which contradicts (4A). Thus 3 /m. 

Since m # 0 then x,y have the same parity. If x,y are not both 
odd, let s > 1 be the largest integer such that 2° divides x,y. We 
write « = 2°2', y = 2°y' so 2’ or y’ is odd and 2” + y’? = 27-3823. 
thus (2’, y’, z) is a nontrivial solution of an equation of the same type; 
then x’, y’ have the same parity, so both are odd. Thus, changing m 
into m— 3s, we may assume without loss of generality that (4.4) has 
a nontrivial solution (x,y,z) with x,y odd, and also gcd(z, y, z) = 
gcd(x,z) = gcd(y,z) = 1; therefore gcd(x,y) = 1. With the same 
argument, we may also assume that z is odd. So, we have 


wag +y=(x+y)(2? —zy+y’), 


with gcd(a+y, x? -—axy+y’) =1 or 3. Indeed, if p is a prime, e > 1 
and p® divides both z+ y and x? — ry+ y” then x = —y (mod p*) 
so x? —xy+y? = 32? (mod p*); thus p* | 3x7; since gcd(z,y) = 1 
then p x, hence p? | 3, that is p* = 3, proving the statement. 

Moreover, since x? — ry + y? = (x+y)? — 3zy, then 3| r+ y is 
equivalent to 3 | 2? — xy + yz, which is in turn equivalent to 3 | z 
and again to gcd(a+ y, 2? — ry +y’) = 3. 

We are led to two cases. 


Case 1: 3 fz. 
Since ged(a + y, x? — xy + y*) =1 and x? — ry + y” is odd there 
exist odd, relatively prime integers a,b such that 
gty = 2™a3, 
x —syty? =, 
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with z = ab. Since x + y, x — y are even, we may write 


gty\? rv Y 2 
nS 8 
( 2 v ( 2 
It follows from Lemma 4.6 that there exist integers t,w such that 


aru tot = (t + w¥—3)? 


2 
and 
(x +y)/2 = t(t? — 9w’), 
(x — y)/2 = 32(t? — w?), 
b=t? + 3w”. 

If ¢ = 0 or t? — 9w? = 0 then & = -y soz =0. Ifw=0 or 
Pw = 0 then 2 = y = 0org = y= B= Al with m= 1. 
This was excluded by the hypothesis. It follows that |¢? — 9w?| 4 1, 
otherwise t+ 8w = +1, t — 3w = +1 which is easily seen to be 
impossible. 


We have 3 | t, because otherwise 3 | b so 3 | z, contrary to the 
hypothesis. Since } is odd, so are t + 3w and t — 3w. Hence t, t + 
3w, t — 3w are nonzero, pairwise relatively prime integers. From 
2™~'q3 = t(t+3w)(t—3w) it follows that there exist nonzero integers 


c, d,e such that 
bao le. 
t+3w = d3, 


t—3w =e', 


and c,d,e are odd, pairwise relatively prime, with a = cde. Hence 
d? + e? = 2™c? so (d,e,c) is a solution of the given equation. But 
s™—le]3 = |t| < |t| x |t? — 9w?| = 2™"Ja|? hence |c| < jal. Also, 
s™al? = |z+y| < |z+y||2*-zyt+y?| = |2°+y"| = 2™|z|? 80 |e] < |2I. 
Since 3 / c, repeating the argument with the solution (d,e,c) this 


would yield a sequence of solutions (dj), €1,¢1), (dz, €2,¢2),... with 
\z| > |c:| > |c2] > ---, all the c; being nonzero integers, which is 
impossible. 

Case 2: 3 | z. 


Now gcd(a+y, 2?-xy+y’) = 3. We note that 3? /2?-azyt+y? = 
(x + y)? — 3xy; otherwise 3 | xy so 3 divides both x and y, contrary 
to the hypothesis. 
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Thus, there exist odd, relatively prime integers a,b such that 
gpy jo" =< 370%, 
x —ayty? = 3d", 
with 3 /band z = 3ab. Since x+y, 2-y are even and 3 | +y we may 
write 3b? = Saad /2)° + 3((a — y)/2)" hence b? = ((x — y)/2)? + 
3((a + y)/6)°. It follows from Lemma 4.6 that there exist integers 
t,w such that 


not ty = (t + w¥—3)3 


2 

and 
(c—y)/2 =t(t? — 9w”), 
| (a+ y)/6 = 3w(t? — w?), 
b=t?+3u”. 

lft = 0 or #? —9w? = 0 then 2 =y =0 or g = y =z = +1 with 
m= 1. Ifw=0 ort? —w? = 0 then x = —y so z = 0. This was 
excluded by the hypothesis. It follows that |t? — w?| 4 1 otherwise 
t+w= +1, t—w = +1, which is easily seen to be impossible. 


Since b is odd, so are t+ 3w, t—3w, hence also t+w, t—w. There- 
fore w, t — w, t+ w are nonzero pairwise relatively prime integers. 
From 2"~'a? = (a+ y)/18 = w(t — w)(t + w) it follows that there 
exist nonzero integers c,d, e such that 


wa, 
| t+tw =d, 
t—w =e, 
and c,d,e are odd, pairwise relatively prime, with a = cde. Hence, 
d? — e = 2c’, so (d, —e,c) is a solution of the given equation. 
But 2"~"|\c|? = |w| < |w||t? — w?| = |a + y|/18 = 2”~* a], thus 
\c| < ja]. Also 
273? /a|* = |x +y| 
e etylle* = sy+y" 
~ 3 
jz? + y?| 
3 
2r| 2° 
3 p) 


hence |c| < |a| < |z|/3. 
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Whether 3 divides c or not, we repeat the argument in the first 
or second case, and this leads to a sequence of solutions (d;, e1,¢1), 
(dz, €2,C2),... with |z| > |e,| > |co| > ---, all the c; being nonzero 


integers, which is impossible. 


The 


theory of the equation X? + Y? = AZ® has been further 


developed by Legendre (1808, 1830), Pepin (1870, 1875, 1881), Lucas 
(1878, 1880), Sylvester (1856, 1879) and Hurwitz (1917) who proved 
the impossibility of the equation in integers, for many values of A 
— but we shall not enter into this matter. Other interesting papers 
on ternary cubic diophantine equations are from Hurwitz (1917), 
Mordell (1956); see also Mordell (1969). 


1657 


1760 


1770 


1802 


1808 


1823 


1830 
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1.5. The Eisenstein Field 


We shall now give the proof of Gauss that Fermat’s cubic equation 
has only trivial solutions in the Eisenstein field. The set of complex 
numbers a + b,\/—3, where a,b are rational numbers, constitute a 
field, called the Eisenstein field and denoted by K = Q(./—3). The 
numbers a = (a+ b\/—3)/2, where a,b are ordinary integers of the 
same parity, are called the integers of K. They constitute a ring, 
denoted by A. If a,@ € K, G divides a if there exists an integer 
y € A such that a = By. We write 3 | a when £ divides a. Two 
nonzero integers a, are associated if a divides G and £ divides 
a; we write a ~ (@. The integers associated with 1 are called the 
units of K. It is easily shown that they are +1, +¢, +¢?, where 
¢ = (-1+ V—3)/2, ¢? = (-1— V—3)/2. We note that ChE that 
is, ¢ is a primitive cubic root of 1, and 1+ ¢+¢? = 0. A nonzero 
integer a € A is a prime if it is not a unit and the only integers 
dividing a are units or associated with a. 

In the particular field Q(\/—3) under consideration, it is true 
that every nonzero integer a is the product of prime integers: a = 
1Y2°':Ys- This decomposition is unique, in the following sense: if 
we also have a = 6,62---6,, where each 6; is a prime of K, then 
s = t and, changing the order if necessary, 7; and 6; are associated 
(for every i = 1,...,s). Therefore, we may define, in the obvious 
way, the greatest common divisor of nonzero integers of A which is 
unique up to units of A. 

The proofs of the following properties may be found in any stan- 
dard text on algebraic numbers, for example, in Ribenboim (1999). 
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The conjugate of a = (a+ b/—3)/2 is @ = (a — b\/—3)/2. The 
norm of a is N(a) = a@ = (a — 3b*)/4. 

If a€ A then Aa = {Ga | 3 € A} is the ideal of multiples of a. If 
a,B,y € A, a #0, we write 


B=y7 (mod a), 


when a divides G — y; we say that G and y are congruent modulo 
a. This is an equivalence relation on the ring A and the set of 
equivalence classes is denoted by A/Aa; the equivalence class of 3 
is denoted by 6 and called the residue class of 3. We define the 
addition and multiplication of residue classes as follows: 6 +7 = 
B+, B-¥ = by. Then A/Aa is a ring, called the residue ring of A 
modulo a. The residue ring A/Aa is finite; its number of elements 
is equal to |N(a)|. 

Now we describe the decomposition of prime numbers p as prod- 
ucts of prime elements of the ring A. 


(1) p = 3 is ramified, that is, 3 = (—¢*)A*, so 3 ~ A’, where 
A} =1-¢ = (3— V—8)/2, 2 is a prime element of A. There are 
three residue classes of A modulo ); the set {0,1,—1} is a system of 
representatives of the field 4/A\. The norm of » is N(A) = AA = 
(1-—¢)(1-¢7) =1-¢€-@41=3, since 14+¢6¢+C=0. 

(2) p = 2 is inert, that is, 2 is a prime of A. There are four residue 
classes of A modulo 2, that is, A/A2 is the field with four elements; 
the norm of 2 is N(2) = 4. 

(3) If p = 1 (mod 8) then p ~ A,A2, where 21, Az are prime ele- 
ments of A which are not associated (A; % A2); we say that p splits 
(or is decomposed). Now A/Ap has p” elements and it is the direct 
product of two copies of the field F,, with p elements, and 


N(A1) = N(A2) = P 


(4) If p = —1 (mod 3) then p is a prime element, that is, p is inert; 
A/Ap is a field with p? elements, N(p) = p’. 


We shall not need (2), (3), (4) above in Gauss’ proof. 
We shall need the following precise congruence: 


LEMMA 5.1. If a € A and X does not divide a then a® = +1 
(mod A‘). 
PROOF. Since a # 0 (mod A) then a = +1 (mod A). First, we 


assume a = 1 (mod X), soa—1= BA where G € A. Then a—¢ = 
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(a1) 6) = BNA AS ABP yaa Sa =o) CS 
AB +1)+¢CA = A(B— ¢?). Hence a* — 1 = (a—-1)(a—-¢)(a—¢?) = 
°B(6+1)(6-—¢?). But 1—¢? = (1+ ¢)A, or ¢? = 1 (mod X). Hence 
GB, G+1, G—C? are in three different classes modulo X, and at least one 
is a multiple of A. Therefore a? = 1 (mod A‘). If a = —1 (mod 4) 
then —a? = (—a)? = 1 (mod A*), so a? = —1 (mod 4%). 


The following result of Gauss implies (4A): 


(5A) The equation 
(521) X°+Y%+ 7 =0 


has no solution in algebraic integers of Q(V—3), all different from 
0. 


Proor. Assume that €,7,6 € A are nonzero and satisfy €?+73+6° = 
0. If gcd(&, 7, 0) = 6 then €/6, 7/6, 0/6 satisfy the same equation 
and gcd (€/6, 7/6, 0/5) = 1. So we may assume gcd(,7,0) = 1 and 
therefore, £, 7, @ are pairwise relatively prime. So A cannot divide 
two of these elements €, 7, 6. We may assume, for example, that 


ALE, A Jn. 
First Case: We assume that \ /0. 


Then 
63 
63 (mod 43), 


so0=€47°+0=+1 1 (mod \3). The eight combinations 
of signs give +1 or +3. These are congruent to 0 modulo 3, since 
+1 are units, +3 are associated with A”, hence not multiples of A°. 


(mod \°), 
(mod \°), 


TT 
— or 


ee 


Second Case: We assume that A | 0. 
Let 0=A™y, w € A, m > 1, and A does not divide w. The essen- 
tial part of the proof consists in establishing the following assertion: 
Let n > 1, and let ¢ be a unit of A. If there exist a,@,y € A, 
pairwise relatively prime, not multiples of A, and a?+3+e\3"73 = 0, 
then: 
(a) n > 2; and 
(b) there exist a unit ¢, and aj,(1,y1 € A, pairwise relatively 
prime, not multiples of A, such that a? +63 +¢,49°"") 43 = 0. 
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The hypothesis is satisfied with n= m,e=1,a=£& B=, 7 
w. By repeated application of the above assertion, we would find a 
unit <’, and a’, ’,7/ € A not multiples of \, such that a’ + 6 + 
e'\?4/* = 0, and this contradicts (a) above. 

First we show that n > 2. Indeed, A /a and A /G. So by Lemma 
5.1, a3 = +1 (mod 4‘), 6? = +1 (mod A*) and +1 +1 = —eA°"y 
(mod A*), A /y. Since 7 / +2 the left-hand side must be 0. From 
dy we conclude that 3n > 4, so n > 2. 

Now we prove (b). We have 


(5.2) —ed"% =a0° + B = (at B)(a+CB)\(at C78). 


Since 4 is a prime element dividing the right-hand side, then it must 
divide one of the factors. But a+ 6 =a+¢8=a+¢°6 (mod X) 
because \ = 1—¢, 1— ¢? = —¢?A so \ must divide all three factors; 
hence (a+ 3)/A, (a+ ¢8)/A, (a+ €?B)/rA € A and 


-etn (28) (8D) (242) 


Since n > 2, divides the right-hand side, hence at least one 
factor. It cannot divide two of the factors, otherwise two among 
a+, a+¢B, a+C¢? are congruent modulo \?. We check that this is 
not possible: (a+3)—(a+¢3) = B(1—¢) = BA = 0 (mod 4?) implies 
A | @, a contradiction; (a+ 8) —(a+¢?B) = B(1—¢?) = —BC7A =0 
(mod 7) implies \ | @ again; (a + CB) — (a+ ¢?B) = CB(1—¢) = 
¢3X = 0 (mod A?) implies 2 | 6 again. 

Let us assume that A divides (a+ 3) /A (the other cases are treated 
by replacing 3 by ¢G or ¢78). Then \3~-) divides (a+3)/X. There- 
fore 


(5.3) at+¢68 = Ako, 


a+ C78 = AkK3, 


with &1,K2,K3 € A, A not dividing kK, K2, k3. Multiplying, we have 


| at B= °°? ka, 


(5.4) — ey? = Ky Kok. 


We note that «1, K2, 3 are pairwise relatively prime. For exam- 
ple, if 6 € A divides «,, Kz, then 6 divides (a + 8) —(a+¢8) = 
B(1 — ¢) = BA, and similarly when 6 divides &,, «3 (or K2,K3). But 
does not divide K1,K2,K3, so 6 is not associated with A; hence 6 
divides @ and therefore also a, which is a contradiction. 
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By the unique factorization in the ring A, it follows from (5.4) that 
the elements #1, K2, K3 are associated with cubes, i.e., there exist 
units w; € A and elements yu; € A such that K; = wy? (4 = 1,2,3). 
So 


(5.5) a+¢B = Apbwe, 
at 673 = rp3w3. 


We note again that p41, fo, p43 are pairwise relatively prime and X 
does not divide p41, U2, “3. Thus 


0 = (a+ 8)+¢(a+¢8)+ C(a+¢78) 
= "7 han + CApaw2 + CAUaWs, 


| a sty B — peed oe 


sO 
13 + THs +MY et = 0, 
where 7, 7’ are units, /11, Jl2, U3 € A are not zero, and gcd(p2, u3) = 
1. If r = 1, we have established (b). If 7 = —1, we replace ju3 by 
—p3 and have again shown (b). To complete the proof we show that 
the unit 7 cannot be equal to +¢ or + ¢?. In fact, w3 + TH} = 0 
(mod A”). Since w3 = +1 (mod A*), w3 = +1 (mod A‘) then p3 + 
TH3 = +147 = 0 (mod 47). However, +1 + ¢ #0 (mod \?) and 
1+¢? 40 (mod A?), so 7 ¢, +¢?, and the proof of (b) is now 
complete. 
As already explained, this suffices to prove the theorem. 


We take this opportunity to indicate some results similar to (5A) 
that may be proved with the same methods. They may be attributed 
to Euler and Legendre. 


(5B) Let p be a prime, p= 2 or 5 (mod 9). Ife is a unit of K, 
if there exists x € A such that x3 =e (mod p), then e = +1. 


Proor. Assume that ¢ = +¢ or £¢? and that there exists x € A such 
that x? =e (mod p). Since p = 2 (mod 3) then p is a prime element 
of A, A/Ap is a field with p? elements, and x”°~! = 1 (mod p). But 
p?—-1=(p+1)(p—1) = 3(r +1)(8r +1), where p = 3r + 2. Hence 


2 
r+1 eft) Br+1) = 7? 1a 1 


E (mod p). 


If r =0 or 1 (mod p) then p divides 1+¢ or 1+ ¢?. Noting that 
1-—¢? = (14+¢)(1—¢) and that 1+¢€ =1-¢?, 1+? = —¢ are units, 
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then p divides 1—¢. This would imply that p = 3, which is contrary 
to the hypothesis. Hence r = 2 (mod 3) and p = 8 (mod 9), which 
is again a contradiction. 


Following Mordell, we prove the classical result: 


(5C) Let p be a prime, p= 2 or 5 (mod 9), and let € be a unit of 
Q(¢). The equation 


(5.6) X°+Y% + caZ? =0, 
with a = p or p?, a # 2, has only the trivial solution (x,y,z) in Q(C), 


namely z = 0, x = —y, or —Cy, or —C?y. If a = 2 then there are 
also the solutions x? = y? = z2 = +1 when e = —1. 


Proor. To begin we note that if (x,y,z) is a solution with x = 0 
or y = 0 then necessarily z = 0. Indeed, if both x,y are 0 then so 
is z. If, for example, y 4 0 and ¢ £4 0 we may assume gcd(y, z) = 1. 
Then p divides y, so p? | az®, hence p | z, a contradiction. 

Assume now that x,y,z € A are such that 7°+y?+eaz? = 0, with 
x,y,z #0. We may also assume that gcd(z, y, z) = 1 from which it 
follows that x, y, z are pairwise relatively prime. 

Among all possible solutions, consider one for which the absolute 
value of the norm |N(xyz)| is minimum. Note that 2° + y® 4 0 since 
z#0;so x4 —y, —Cy, —C?y. 

Consider the Lagrange resolvents 


a=xr+y, 

B= Cat Cy, 

y= Cat Cy. 
Then a, 6,7 € A, a, 6,7 = 0 and 


aBy = (a? + ¢y? + (1+ day) (Ca + y) 
= O(¢a? + Cy? + (1464 C)a?y t+ (1464 C) zy’) 
= e+ y° = -caz*. 


Let 6 = gced(a, GB, 7) € A, so ged(a/6, 3/6, y/6) = 1 and 


a oem C. 
a ee ek 
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Hence a divides one and only one of the factors in the left-hand side, 
say 7/6 (the other cases are similar). By the unique factorization 
theorem, which is valid in the ring A, we have 


a/6 = €,0%, 
B/é = £237, 
7/6 = e307, 


where €1,€2,€3 are units of A, and a,,0,,y € A, a1,01,% 4 0, 
gcd(ay, 31, 1) = 1. Hence 


6108 + 6933 + e3a7; = 0. 
Let é! = €9/€,,€” = €3/€1; hence 
(5.7) at + eB + e"ayi =0. 


We have p / 3; otherwise p | a1, so p® | a7y1°, hence p | 7 and this 
is impossible because gced(aj, (1,71) = 1. Taking the classes modulo 
p, (5.7) yields 

aj +3; =0 (mod p), 
hence ¢’ is a cube modulo p. By (5B), e’ = +1, so 


a3 + (+6,)° + eay; =/0 


and (a,, +1, 71) is another nontrivial solution of (5.6), with gcd(a,, 
+01, ¥,) = 1. By assumption, 


Nava) < INCaBrn)I =|v(S7)| =| (4) 
hence |N(éxy)| < 1. This implies that x,y,6 are units. Hence x? = 
+1, y2 = +1, and +1+1+caz3 =0. 

If a~2 then z = 0, a contradiction. 

If a = 2, we must have either z = 0, or x? = y? £0, x? +ez* =0; 
then € is a cube modulo p, therefore by (5B), ¢ = +1. But clearly 
€ = +1 would imply z = 0, soe = —1 and 2? = 7° = 22 = 41. 


3 


o] 


As an immediate corollary we have: 
(5D) The equation 


(5.8) X34 4¥% =1 


has no solution in nonzero integers. 
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PROOF. Consider the equation 
(5.9) X34 77 +4Y? =0. 


If (x, y) is a nontrivial solution of (5.8) then (2, —1, y) is a nontrivial 
solution of (5.9), which is impossible, by (5C). 


We also note other consequences: 


(5E) = The equations 


(5.10) KX 27TY? = 97", 
(5.11) X® —16 x 27Y® = Z?, 
(542) 16X° —27Y° = 2’, 


have no solutions in nonzero integers. 


Proor. Assume that x° — 27y° = 223, with x,y, z nonzero integers. 
Then (x7)? + (—3y?)? — 227 = 0. By (5C), 2° = y® = z® = +1 and 
therefore x = y = z = 1, which is impossible. 

If 2° — 16 x 27y° = 23, then multiplying with 2° we have (2x7)? + 
(—2z)° — 2(2? x 3y”)* = 0. By (5C), (2x7)? = (—2z)? = (2? x 3y”)® = 
+1 and therefore 2x? = +1, a contradiction. 

Finally, if 162° — 27y° = z® then (3y?)? + 2? — 2(2x)? = 0, so 
by: 6C), (397)" = 2" = (207)? = 241, thus 247 = 5,6. contradic- 
tion. 


Furthermore, Legendre showed: 
(5F) The equation 
X°4 Y% = 378 


has no solution in integers different from zero. 


PROOF. The proof may be conducted following the same lines, with 
appropriate changes. 


In 1856, Sylvester also announced: 
(5G) The equation 
X°+V°4+Z77+6XYZ=0 


has no solution in integers different from zero. 
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1.6. The Quintic Equation 


The case n = 5 was first settled by Dirichlet. His paper was read at 
the Academy of Sciences of Paris in 1825, but his proof, published in 
1828, did not consider all the possible cases. Legendre then published 
a complete and independent proof, while Dirichlet was able to settle 
the last remaining case. We reproduce Dirichlet’s proof in modern 
language, using a few facts about the arithmetic of the quadratic field 
K = Q(v5). The proofs may be found, for example, in Ribenboim 
(1999). 

Let A be the ring of integers of Q(V/5). The elements of A are of 
the form (a+bV/5)/2, where a, b are integers of the same parity. The 
invertible elements of A, i.e., the units of K, form a multiplicative 
group. (a+ bV5)/2 is a unit if and only if its norm 


(*) (*) _ a’? — 5b? 


2 2 4 


is equal to +1 (i.e., a? —5b? = +4). It may be shown that the units of 
A are precisely the elements +((1 + V5)/2)°, where e is any integer. 

An important fact required in the proof is that every element 
of A may be written (up to a unit) in a unique way as a product 
of powers of prime elements. Or, equivalently, every ideal of A is 
oe Among the prime elements of A there are the numbers 
2, V5. 
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We begin by establishing a property concerning certain principal 
ideals of A which are fifth powers. 


(6A) 
(1) Let a,b be nonzero integers such that gcd(a,b) = 1, a # b 
(mod 2),5)/a,5 | b. If a? — 5b? is the fifth power of an 
element of A then there exist nonzero integers c,d such that 


(6.1) { a = c(ct + 50c?d? + 125d"), 


b = 5d(c* + 10c?d? + 5d‘), 
and gcd(c,d) = 1, c#d (mod 2), 5 Jc. 
(2) Let a,b be integers such that gcd(a,b) = 1, a,b both odd, 
5 Va, 5 |b. If (a® —5b?)/4 is the fifth power of an element of 
A, then there exist nonzero integers c,d such that 


(6.2) b = 5d(c! + 10c?d? + 5d*)/16, 


and gcd(c,d) = 1, c,d are both odd, 5 Jc. 


{ a = c(c* + 50c?d? + 125d*)/16, 


ProoF. If the nonzero integers c,d satisfy (6.1) (respectively, (6.2)) 
then gcd(c,d) = 1,5) c, c,d cannot be both odd, otherwise a,b 
would be both even (respectively, c,d cannot have different parity, 
otherwise 16 would divide c and d, so c,d must be odd). 

Now we prove the existence of c,d, in both cases. 

(1) We first observe that if a+byV/5 = ((h+kv5)/2)> withh =k 
(mod 2), then h,k are even. Indeed, 2°b = 5k(h* + 10h?k? + 5k*), 
so 2° divides h* + 10h?k? + 5k*. If h,k are odd then k = +1, +3 
(mod 8), so h? = 1,9 (mod 16), h* = 1,17 (mod 32) and similarly 
for k, k?, k*+. Hence h* + 10h?k? + 5k* is congruent modulo 32 to 
either 


Ss oe. DOS ie. 
Po OS ok ={ 1+90+85 =16, 


or 
17+26+5 =16, 


2 ra 
He eOM crak ={ 17 + 234485 = 16, 


which is a contradiction. 

Now we show that ged(a+ bV5, a — b\V/5) = 1. In fact, if a prime 
element a € A divides a + bV5 and a — bV5 then a divides 2a and 
2bV/5. If a| V5 then a = V5w (w a unit of A); so V5 divides 2a,5 
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divides 4a? (in Z), and 5 | a, contrary to the hypothesis. Thus a 
divides 2b; since there exist integers s,t such that 2as + 2bt = 2 
then a would divide 2. But 2 is a prime in A, then a = 2w (wa 
unit of A). So 2 divides both a+ b/5 and a— by5, hence 4 divides 
a? — 5b? = (a+ bV/5)(a — by/5). But a,b have different parity, so 
a? — 56? is odd, a contradiction. 

Since gcd(a + bV5, a — b\V/5) = 1 and a? — 5d? is the fifth power 
of an element of A, it follows from the unique factorization in A 
that a+ by/5 is the fifth power of an element, say a + bV/5 = ((m+ 
nv/5)/2)°, where m =n (mod 2). Hence 


sen (25) (298) 


2 2 


where t = u (mod 2), (t+uv/5)/2 is a unit of A, and so t?—5u? = +4. 

Let ((m+nvV/5)/2)> = (m'+n'vV/5)/2 so that 16m’ = m> (mod 5), 
16n’ = 0 (mod 5) and hence 5 | n’. Also 4a = m’t + 5n’u, 4b = 
m'u+ n't; hence 5 /'m’ (otherwise 5 | a, contrary to the hypothesis) 
and therefore 5 /m. Since 5| n’,5| 6 then 5| m’u so 5|u. If u=0 
then ¢ = +2 and a+ bV5 = +((m+ nV/5)/2)°. From the remark at 
the beginning, m,n are even and we put c = +m/2, d= +n/2 and 
it is clear that c,d satisfy the relation (6.1). 

If uA~ 0 then (t + uV/5)/2 = +1 and hence (t + uV5)/2 = +((1 + 
V5) /2)° with some exponent e 4 0. Replacing, if necessary, (1 + 
V5) /2 by its inverse, —(1— V5) /2, we may assume e > 0 and actually 
e > 1 (otherwise u = +1 contrary to the fact that 5 | wu). Then 
Be aia + ud) = (1 a's V5); therefore 


stu=e45()) +5(5) Ss ea 


so 2°-'u = +e (mod 5) and since 5 | u then 5| e. Thus e = 5f. Let 


m+nvJ5 (1495) Jd+dV/5 


2 2 “7 — 2 
where c! = d' (mod 2). Then a+ bV5 = +((c + d’vV5)/2)°. By the 
remark at the beginning, c’,d’ are even. Let c = +c'/2, d = +d'/2. 
Then c,d satisfy relations (6.1). 


(2) The proof in this case is very similar, so we only indicate the 
main steps. First we prove that gcd((a + bV5)/2, (a — bV5)/2 = 1, 
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hence (a + by/5)/2 = ((m + nv/5)/2)?((t + uV5)/2) with m = n 
ee 2),t =u (mod 2), t? —5u? = +4. Then 5 | u. If u = 0 let 
+m, d= +n, so relations (6.2) are satisfied. If u # 0 then 
(t + u5) /2 = +((1 + V5)/2)* with e > 0 (actually e > 1). Then 
e = 5f, and letting 


dp. (nam ( avs) c+dvV5 
a i 9 b) 


2 2 


it follows that (a+ b/5)/2 = ((e + dV5)/2)° and again the relations 
(6.2) are satisfied. 


(6B) The equation 
(6.3) X°+Y°+Z7°=0 
has no solution in integers all different from 0. 


PROOF. We assume that there exist nonzero integers x, y, z such that 
x+y? +2? =0. We may assume that gcd(z,y,z) = 1 and hence 
x,y,z are also pairwise relatively prime. 


First Case: 5 does not divide xyz. 

Then x, y, z are congruent to +1 or +2 (modulo 5). Since 2° = x 
(mod 5), y° = y (mod 5), 22 =z (mod 5) thenrz+y+z2=2°+y?+ 
2° =0 (mod 5). 

If x,y,z are pairwise incongruent modulo 5 then «+y+z2 #0 
(mod 5). So, for example, x = y (mod 5). Then —z = a+ y = 22 
(mod 5). Raising to the fifth power, x° = y® (mod 5*), —z° = 2°2° 
(mod 5”) hence also —z°? = 2° + y® = 22° (mod 5?) and therefore 
2°x° = 2x° (mod 5”), so 2° = 2 (mod 52), which is not true. This 
proves the proposition in the first case. 


Second Case. 5 | z (for example). 

Then 5 /ay. Since gced(x, y) = 1 then either x,y are both odd or 
of different parity. 

We first consider the case where x,y are odd. Then —z° = 2° +y/° 
is even, so 2,5 divide z. We may write z = 25"z', with m>1,n> 
1, z’ not a multiple of 2 or 5. Replacing z by z’, we have nonzero 
integers x, y, Z, pairwise relatively prime such that 


(6.4) —2° 5 = oP +y?, 
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with x,y,z odd and not multiples of 5,5 m>1,n>1. Let r+y= 
2p, x — y = 2q (p,q integers not equal to 0) sot =p+q,y=p-Q, 
with gcd(p, q) = 1 and p,q not both odd. Then 


= 25" 29 = (p +g)? + (p— gq)” = 2p(p* + 10p?q” + 54’). 


Since 5 | p or 5 | p* + 10p?q? + 5q* it follows that 5 | p and we write 
p=5r. So 5 /q and gcd(r,q) = 1, g,r having different parity. Thus 


2°59" 2? = 2 & bi e(q™ + 50g?” 1.1257"). 


Let t = q* + 50q?r? + 125r* = (q? + 25r?)? — 5(10r?)?. We put u = 
q’ + 25r?, v = 10r?, so u,v are not 0,u is odd, 10 | v, gcd(u, v) = 1. 
Then t is odd, 5 /'t, gcd(t,r) = 1, so 5 | r (since 5n > 2). 

Since ged(2 x 5?7r, t) = 1 then 2 x 5’r and t are fifth powers of inte- 
gers. But t = u?—5v? with u # v (mod 2), ged(u,v) = 1,5 Yu, 5 | v. 
By (6A), there exist nonzero integers, c,d such that 


u = c(c* + 50c?d? + 125d‘), 
v = 5d(c* + 10c?d? + 5d*), 


and gcd(c,d) = 1,c # d (mod 2), 5 /c. From this, it follows that 
5 | d since 5 | r thence 5? | v. We also note that d > 0. Multiplying 
the last relation by 2 x 5° we have (2 x 5’r)? = 2 x 5° x 10r? = 
2 x 5*d(c* + 10c?d? + 5d*) and this number is a fifth power (since 
2 x 5°r is a fifth power). 

But gced(2 x 54d, ct + 10c?d? + 5d*) = 1 because c* + 10c?d? + 5d? 
is odd, 5 /c and gcd(c,d) = 1. Hence 2 x 54d and c*+ 10c?d? + 5d* = 
(c? + 5d?)? — 5(2d?)? are fifth powers. Again c? + 5d?, 2d? are not 
both odd, gced(c? +5d?, 2d?) = 1, 5 /c?+5d?, 5 | 2d”. By (6A), there 
exist nonzero integers c’, d’ such that 


+5 =c(c* + 507d? + 125d’), 
2d? = 5d’(c* + 10c7d” + 5d’), 
and gcd(c’,d’) = 1, ¢d 4d’ (mod 2), 5 /c’. From this it follows that 
5 | d’ because 5? | d?. We also note that d’ > 0. Multiplying the last 
relation by 2 x 5° we have 
2? x 58d? = (2 x 54d)? =2 x 5°d'(c* + 10c7d"* + 5d’), 


and this number is a fifth power. Since ged(2 x 5°d’, 841007 d? + 
5d'*) = 1 then 2x 5°d', ¢* +10¢7d’” + 5d” are fifth powers. This is 
analogous to the previous assertion that 2 x 5*d, c+ + 10c?d? + 5d?* 


54 I. Special Cases 


were also fifth powers. Moreover, 0 < d’ < d, because 25d” < 
5d'(c* + 10c’*d’? + 5d") = 2d?, so 0 < d! < 5,/(2d?)/25 < d. If this 
procedure would continue, we would reach an integer d” such that 
0 < d’ <1 and this is absurd. 

It remains to consider the case where x and y are of different 
parity. We omit some details of the computations. 

Let e+ y = p,x—y = q 80 p,q are odd, gced(p,q) = 1, 2a = 
pt+q, 2y=p—q. Then —2° x 5°%z° = (22)? + (2y) = (pt q)? + 
(p — q)? = 2p(p* + 10p?q? + 5q*). Since 5 | p we write p = 5r, so 
5 /q, gcd(q,r) = 1, g,r are both odd, 


0) SebeN gh = 2 xe bert 


where t = q*+50q?r7+125r* = u?—5v? with u = q?+25r?, v = 10r?. 
Then u,v are not 0, u,v are even, u = 2 (mod 4), 5 Jt, gcd(t,r) = 
1 so 5 | r (since 5n > 2). We write u = 2u’,v = 2v’, u’,v’ are 
odd, gcd(u’,v’) = 1,5) w,5 |v’. If t = t/4 = wu? — 5v” then 
t’ = 0 (mod 4), and —5°"2° = 5rt’/4, with gcd (5?r, t’/4) = 1. So 
5?r and t’/4 = (w’* — 5v’”)/4 are fifth powers. By (6A) there exist 
nonzero integers c,d such that 


u’ = c(ct + 50c?d? + 125d") /16, 
uv’ = 5d(c* + 10c?d? + 5d*) /16, 


and ged(c,d) = 1, c,d are both odd, 5 /c. Moreover, since 5 | r then 
5? |v’ so 5 | d. We note also that d > 0. 
Multiplying the last relation by 5° we have 


(2480 sa 


where ((c? + 5d?)/2)” — 5d* = 0 (mod 4). Since the two factors in 
the right-hand side are relatively prime and (57r)? is a fifth power, 
then 54d and }[((c? + 5d?) /2)? — 5(d?)?] are fifth powers. By (6A), 
there exist nonzero integers c’,d’ such that 


54d 
4 


(5?r)? = 53! = 


(c? + 5d?) /2 = (c* +.50c7d” + 125d") /16, 
d? = 5d'(c* + 10¢7d’ + 5d’) /16, 


with gcd(c’,d’) = 1, c,d’ both odd, 5 / c. Moreover, 5 | d’ and 
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TABLE 3. FLT for the exponent 5. 


Author Case Year 

Gauss both 1863 (posthumous publication) 
Schopis first 1825 

Lebesgue both 1843 

Lamé both 1847 

Gambioli both 1901 and 1903/4 
Werebrusow both 1905 

Mirimanoff first 1909 

Rychlik both 1910 

Hayashi? both 1911 

van der Corput both 1915 

Terjanian both 1987 


d' > 0. Multiplying the last relation by 5° we have 


9 
582 = (54d)? = “s (c* + 10¢7d? + 5d’) 
59d’ | {c? +5d?\" ny 
a ( : 5(d’”)?| 


Again 5°d' and 4{((c’? + 5d’) /2)? — 5(d’”)”] are fifth powers. This 
is analogous to the previous assertion. Moreover 0 < d’ < d, because 
25d’? < 16d?. The continuation of this procedure would lead to a 
contradiction. 


In 1912, Plemelj proved the following extension of the preceding 
theorem, see also Nagell (1958): 


(6C) The equation 
X°+Y°4Z°=0 


has only trivial solutions in integers of the number field Q(v5). 


Other proofs of Fermat’s theorem for the exponent 5 are given by 
the authors in Table 3. 


?This proof may be incorrect, according to private communications. 
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1825 


1828 


1830 


1843 


1847 


1875 


1901 


Bibliography 


Schopis, Einige Sdtze aus der unbestimmten Analytik, Progr. 
Gummbinnen, 1825, pp. 12-15. 

Dirichlet, G.L., Mémoire sur l’impossibilité de quelques équa- 
tions indéterminées du 5° degré, J. Reine Angew. Math., 3 
(1828), 354-375; reprinted in Werke, Vol. I, pp. 1-20 and 
21-46, G. Reimer Verlag, Berlin, 1889, and also by Chelsea, 
New York, 1969. 

Legendre, A.M., Théorie des Nombres (3° édition), Vol. I, 
p. 5, Firmin Didot Fréres, Paris, 1830; reprinted by A. Blan- 
chard, Paris, 1955. 

Lebesgue, V.A., Théorémes nouveaux sur l’équation indéter- 
minée x? +y° = az°, J. Math. Pures Appl., (1), 8 (1843), 
49-70. 

Lamé, G., Mémoire sur la résolution en nombres complexes 
de l’équation A® + B° + C® =0, J. Math. Pures Appl., (1), 
12 (1847), 137-171. 

Gauss, C.F., Zur Theorie der complexen Zahlen. (1) Neue 
Theorie der Zerlegung der Cuben, Werke, Vol. II, pp. 387- 
391, Konigl. Ges. Wiss. Gottingen, 2nd ed., 1875. 
Gambioli, D., Memoria bibliografica sull’ultimo teorema di 
Fermat, Period. Mat., 16 (1901), 145-192. 


1903/4 Gambioli, D., Intorno all’ ultimo teorema di Fermat, I 


1905 


1909 


1910 


1911 


1912 


1915 


Pitagora, 10 (1903/4), 11-13 and 41-42. 

Werebrusow, A.S., On the equation x° + y°® = Az® (in Rus- 
sian), Moskov. Math. Samml., 25 (1905), 466-473. 
Mirimanoff, D., Sur le dernier théoréme de Fermat, Enseign. 
Math., 11 (1909), 49-51. 

Rychlik, K., On Fermat’s last theorem for n = 5 (in Bo- 
hemian), Casopis Pést. Mat., 39 (1910), 185-195, 305-317. 
Hayashi, T., On Fermat’s last theorem, Indian Math. Club, 
Madras, 3 (1911), 111-114; reprinted in Science Reports, 
Tohoku Imp. Univ., (1), 1 (1911/12), 51-54. 

Plemelj, J., Die Unlésbarkeit von 2° + y° + 2° =0 im Korper 
k(/5), Monatsh. Math., 23 (1912), 305-308. 

van der Corput, J.G., Quelques formes quadratiques et quel- 
ques équations indéterminées, Nieuw Archief Wisk., 11 (1915), 
45-75. 


1.7. Fermat’s Equation of Degree Seven 57 


1958 Nagell, T., Sur l’équation 2° +y° = z°, Arkiv. Mat., 3 (1958), 
511-514. 

1973/4 Terjanian, G., L’équation «" + y" = 2" pour n = 5 et 
n = 14, Sém. Th. des Nombres, Bordeaux, 1973/4, exp. no. 
5, 6 pp. 

1987 Terjanian, G., Sur une question de V.A. Lebesgue, Ann. Inst. 
Fourier, 37 (1987), 19-37. 

1999 Ribenboim, P., Classical Theory of Algebraic Num bers, Sprin- 
ger-Verlag, New York, 1999. 


1.7. Fermat’s Equation of Degree Seven 


In 1839, Lamé proved Fermat’s theorem for the exponent 7. Lebes- 
gue found a simpler proof in 1840. Genocchi devised in 1874 and 
1876 a still simpler proof (using an idea already found in Legendre 
(1830)), which is reproduced in Nagell’s book (1951): 


(7A) 
(1) If x,y,z are the roots of a cubic equation with coefficients 
in Q, and x? + y" + z7 = 0 then either xyz = 0 or x,y,z 
are proportional (in a certain order) to the cubic roots of 1, 
namely 1,¢ = (—1+ V—3)/2, @? = (-1— V—3)/2. 
(2) The equation x’ + y’ +2" =0 has only the trivial solution in 
integers. 


PROOF. (1) Suppose that x, y, z are the roots of f(X) = X?—pX?+ 
qX —1r, with p,q,r € Q. Then 


pH=@L+TytZ, 
q=tY + LZ+ YZ, 
p= Dye 
Case I: p = 0. 

We use the identity (see Section II.5) 


(X+Y)'—XT-Y7=TXYV(X+Y)(X°4+ XV 4+"). 


Ifa+y+z =Oand x’ +y’+27 = 0, then 7ry(x+y)(x?+ry+y’)? = 0. 
Hence either x = 0, or y = 0, or z = —(a + y) = 0, or ryz £ 0, 
but 2? + zy + y*? = 0. Therefore (y/x)? + y/z + 1 = 0 and so 
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y/x is a cubic root of 1,y/x #1. Thus y = x¢ (or y = «¢?) and 
z= —(x+y) = —-2(14+¢) = 2¢?, so x, y, z are proportional to 1, ¢, ¢? 
(or to 1,¢7,¢). 


Case II: p £ 0. 
Let k > 1 and s, =2*+y*+2*, the sum of the kth powers of the 
roots of f(X) =0. By Newton’s formulas: 


$1 =P, 

82 = 8p — 2q, 

83 = Sp—siqt3r, 

S4 = 83p — Sot S17, 

85 = S4p — 834 + S27, 

86 = S5P — 84d + 837, 

87 = S6P — 85q + Sal. 
Substituting we obtain 
a ty +2" =p" —Tp’qtTp'r + 14p3¢? — 21 p*qr —Tpq? + Tpr? + 7¢’r. 
Let m= pq—r€Q. Then 


a ty’ +2" =p! —7(pq—r)(p* — pg tq”) + T(pq — 1)*p, 
that is, 
a +y’ +2" =p! —Tm(p* — pq tq’) + 7m’p. 
Since x’ + y’? + z7 =0 then 
p' —7m(p* — p’q + q’) + 7m?p = 0. 
Let qg/p? = Q, m/p? = M. Then 
p' — Tp M(p* — p*Q — p*Q?) + Tp'M? = 0, 


SO 


M?—- M(1-Q+Q?)+4=0. 
Since M is a rational number, the discriminant ((1—Q+Q?) /2)?- 
; is the square of a nonzero rational number. Let 2Q—1 = s/t, where 
s,t are relatively prime integers, t > 0. Then Q = (t+ s)/2t, hence 
64¢*|((3t? + s?) /8t?)? — 2] = s* + 6t?s? — t*/7 = uv? where u is a 
nonzero rational number. 
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Since 7u? is an integer, then u must be an integer, so 7 divides ft, 
and therefore 7 's. Let t = 7°v, with e > 1, 7 /v. Hence 


(7.1) s§+6 x Tyg? — 7A-1yt = uy. 
From (7.1) we deduce that 

(7.2) (824.3 x Tv)? — uw? = 64 x Toy, 

hence 

(7.3) (s7+3 x 7°v? +u)(s? +3 x 7v? — u) = 64 x TA 1". 


Now we show that gcd(s?+3x 7*°v?+u, s?+3 x 77°? —u) is a power 
of 2. Indeed, let p 4 2 be any prime such that p | s?+3x7?¢v?+u and 
p|s?+3x 7*v? —u. Then p | 2u, so p| u. Also p | 2(s?+3 x 77%v?), 
so p| 8? +3 x 7*v?. If p = 7, since e > 1 then 7 = p | s, which is 
a contradiction. If p 4 7, by (7.3) p | 64 x 74*-!v4, so p | v, hence 
p|s, and this is again a contradiction. Thus the greatest common 
divisor of the two factors in (5.3) is a power of 2. 


Subcase (a): v is odd. 

From (5.3) it follows that 
s?4+3x 7ytu= To Aa 
s?+3 x 7%? tu= Bd’, 


(7.4) 


where a,b are relatively prime integers, ab = v (so a,b are odd) and 
A, B are even and AB = 64. 
From (7.4) we deduce that 


(7.5) 252 +6 x Ty? = 74-1 Aa! + Bot. 
Noting that if x is odd then x? = 1 (mod 8) and that 7 = —1 
(mod 8) then 
A B A B 
2 a 2e 2,72 , m4e-1 4 | A 
si =-3 x 7°°a°b* +7 po + Gb S38 Bs (mod 8). 


We consider the various possibilities for A, B: 


A=32, B=2 — s?=-—2 (mod 8), impossible, 
A=16, B=4 — s?=-1 (mod 8), impossible, 
A=8, B=8 — s?=-3 (mod 8), impossible, 
A=4, B=16 — s?=83 (mod 8), impossible, 
A=2, B=32 — s?=4 (mod 8). 
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In this last case, (7.5) becomes 
SAS kT ek HT a + 160; 
hence multiplying with 64, 
64s? + 6 x 77° x 32a7b? — 64 x 16b* = 74°"! x 64a’, 

SO 

643" — (320? — 3 x 7a?) = 7 x 6407 = 78 Xa HT, 
and therefore 
(7.6) (8s + 326? — 3 x 77°a?)(8s — 320? +3 x 77a?) = 7*-1a?4. 


We note that gcd(8s + 32b? — 3 x 77°a?, 8s — 32b? +3 x 77a”) = 1. 
Indeed, since a is odd, the above numbers are odd. If p | 8s +32b? — 
3 x 7a? and p | 8s — 32b? + 3 x 77a? then p | 16s, so p| s, and 
similarly p | 326? — 3 x 7°*a?. But p | 7*°~'a* (by (7.6)). If p = 7 
then p = 7 | s, which is impossible. If p 4 7 then p | a so p| b, again 
a contradiction. This proves the assertion, and therefore 


a 2 2e,2 _ 4 
(7.7) es $3~x 7T*a*=Cc 


8s F 3262 +3 x 7a? = 74-1 ¢4, 


where c,d are relatively prime integers, cd = a. Hence c,d are odd. 
From (7.7) we derive the congruence 


+3=c' (mod 8), 
which is impossible. We have therefore shown that subcase (a) is 
impossible. 


Subcase (b): v is even. 
Then s is odd (because t is even), hence u is also odd. We write 
(5.2) as 


(9? +3 x Ty)? —w=4 x T-1 (20), 


and noting that both factors of (7.3) are even, and their greatest 
common divisor is 2, we deduce that 


2 262 _ 4e—-1 44 
(7.8) eee +u=2 x 7*-1A4, 


St Be our = wa? 6B, 
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with A,B relatively prime integers, 4B = 2v, so either A or B is 
even. Hence 

(7.9) 

82 = —3 x 72€ 9)? + 7te-L At + Bt _ —3 x 72 A? B? He 7ae—1 44 + B*. 

If B is even then A is odd and s? = —3v?—1 (mod 8), hence s? = — 


(mod 4), which is impossible. 
If A is even, then B is odd. Let A = 2A;, hence from (5.9) 


s° = —3 x 7° ATB? + 7%"! x 24 Ai + Bt 


and therefore 1 = s* = —3A{+ 1 (mod 8); hence 3A? = 0 (mod 8) 
so 4| A,. We write A, = 4A2, hence A = 8A, and (7.9) is rewritten 
as 

s? = —3 x 16 x 7° A2B? +71 x 84 x Ab + B’, 


sO 

s*?—(B?—3x 8 x 7°°A3)? = 7° * x 8 x AD — 3? x 8? x TAD 
(7.10) SEB XAG 
This gives 


(7.11) (s—B?+3x8 x 7°A5)(s + B? —3 x 8 x 77°A5) 
SOA (DAs) 


The two factors of (7.11) are even, and it may be seen, as before, 
that their greatest common divisor is 2. Hence 


go Beas 28 TAs — Ie. 
i Bee R Be Aa XR 2a, 


(7.12) 


where Cz, dz are relatively prime integers such that codz = 2Ap. 
From (7.12) we deduce by subtraction that =B? +3 x 8 x 77°A3 = 
c5 — 7*°-1d5, hence 


(7.13) Bj tt6e Meg arr. 


Since any nonzero square modulo 7 is congruent to 1,2 or 4, then 
B? +c} #0 (mod 7). So we must have the negative signs in (7.13): 


(7.14) BP = 6 +6xX 7° de — 7 dh. 


This equation is of the same form as (7.1). Moreover, 


AB 
SS Be = 4A,B > 2cody > dy. 
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TABLE 4. FLT for the exponent 7. 


Author Cases Year 

Legendre (Sophie Germain) first 1823 (see Chapter II, §3) 
Genocchi both 1864 

Pepin both 1876 

Maillet both 1897 

So, we may proceed by infinite descent and conclude that the 


subcase (b) is also impossible. Thus, case II is not possible, and this 


proves 


(1). 


(2) If x,y,z are integers such that x’ + y’ + z” = 0, we consider 
the polynomial f(X) = X? — pX* + qX —r, which has the roots 


L,Y, %. 
LYyz = 


By (1), since x,y,z cannot be proportional to 1,¢,¢?, then 
0. 


Other proofs of Fermat’s theorem for the exponent 7 are given by 
the authors in Table 4. 


1823 


1839 


1839 


1840 


1840 
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1.8. Other Special Cases 


There have been numerous papers devoted to the proof of Fermat’s 
theorem for special exponents, other than 3, 4, 5, 7. The methods 
used were specific to the exponent in question and in most instances 
not susceptible of generalization. 

We note that according to an oral communication of Terjanian, 
Hayashi’s proof (1911) for the second case and exponent 13 has a 
mistake. 

Bréi¢-Kostié studied the equation x*+y? = 2° in 1956 and showed 
that it has no solution in relatively prime integers; however, it has 
nontrivial solutions in integers which are not pairwise relatively prime. 

Now we indicate the elementary proofs of Breusch (1960) for the 
exponents 6 and 10. Of course, the theorem for these exponents 
follows from the truth for the exponents 3 and 5. But the proofs 
which we present will be entirely independent of the above results. 
The following preliminary results, by the method of infinite descent, 
were also proved by Breusch: 
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TABLE 5. FLT for various exponents. 


Author Exponent Case Year 
Kausler 6 both 1806 
Sophie Germain all primes less first 1823 
(Legendre) than 100 (see Chapter IIT) 
Dirichlet 14 both 1832 
Lamé? 11, 17, 23, 29, 41 first 1847 
Matthews 11, 17 first 1885-6 
Tafelmacher* T2329 first 1892 
Thue 6 both 1896 
Tafelmacher 6 both 1897 
Lind 6 both 1909 
Mirimanoff 11, 17 first 1909 
Kapferer 6, 10 both 1913 
Swift 6 both 1914 
Kokott 11 first 1915 
Fell 11, 17, 23 first 1943 
Breusch 6, 10 both 1960 
Terjanian 14 both 1974 


(8A) There exist no positive integers x,y,u,v such that 


(8.1) ety =u? +? 
and 
(8.2) xy = 2uv. 


PRooF. Assume the contrary, and among all possible solutions, con- 
sider the one with minimal positive product zy. From this minimal 
choice it follows that no three of the four integers x, y, u,v can have 
a common factor greater than 1. Also gced(x,y) = 1, because if p is 
any prime dividing x and y then p divides u or v, a contradiction. 
Similarly, gcd(u,v) = 1. Hence one of x,y is even, and the other is 
odd. For example, let 2 | y, so 2? + y” is odd, so u or v is even, 
say u is. Therefore from (8.2) we deduce that 4 | y; from 2? = 1 


3Lamé’s paper (1847) has no proofs. 
4Tafelmacher’s proof (1892) holds only for the first case; see also Dickson, 
History of the Theory of Numbers, Vol. II, pp. 755. 
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(mod 8) it follows that «7+ y? =1 (mod 8). Again, by (8.1), u? = 0 
(mod 8), so u=0 (mod 4) and 8 | y. 
Let y = 2"*1y’ (with r > 2,y' odd), then u = 2"u’, with wu’ odd. 


Thus 
(8.3) ry’ =u'v and x,y’, u'v are odd, 
, gcd(x, y’) = gcd(u'v) = 1. 


Moreover, if 
a= gced(z,u’), b = gcd(z, v), 
e = ged(y', uv’), d= gcd(y’, v), 
then a,b,c,d are pairwise relatively prime and b,d are odd, so b? = 
d? = 1 (mod 8). So x = ab, y’ = cd, uw’ = ac, v = bd. Therefore the 
original equation becomes 
a2b? + Q2r+2 02 2 _— 92° G2 ¢? EE b?d?. 
Letting ¢t = 2”c, we have 
(8.4) 3¢?d? = (a? — d?)(t? — 0), 
where t,a,b,d are pairwise relatively prime and 4 | t. It follows that 
d? divides t? — b? and t? — b? divides 3d?. 
This holds if and only if one of the following conditions is satisfied: 
(1) #? — b? = 3d’; 
(II) t? — b? = d?; 
(III) t? — b? = —3d?; and 
(IV) t? — b? = —d?. 
But t? = 0 (mod 8) and b? = d? = 1 (mod 8). Therefore the cases 


I, I, III are not possible. From t? — b? = —d? it follows that 3t? = 
d? — a?, so 


(8.5) P=d+? =a? + 4’. 


By (1A) there exist integers m,n > 0 and integers r,s > 0 such 
that 


b=m +n’, 
Co) { t = 2mn, 
and 
b=r?+8?, 
(8.7) { 2t = 2rs. 


66 I. Special Cases 


From these, we obtain the relations 


(8.8) rts =m +n 
and 
(8.9) rs = Qin 


Since rs = t = 2"c < 2"y' < y < zy, we have found a new solution 
for the original system, contradicting the minimality of the solution 


(So, Y; U, v). 


(8B) There exist no positive integers x,y,u,v such that 


(8.10) ea-yawtv? 
and 
(8.11) ry = 2uv. 


PROOF. We assume the contrary and consider positive integers sat- 
isfying the above relations and such that xy is minimal. 

Proceeding as before, one of x,y is even and the other is odd. 
Thus x? — y*? = u? + v? = 1 (mod 4); therefore necessarily x is odd 
and y is even. As in the preceding proof, we arrive at equations 
(8.3), (8.4), (8.5), and (8.6) with t = 2’c,r > 2. And we obtain the 
relation a7b? — 2?"+7¢?d? = 2?"q?c* + b?d?, hence 


(8.12) 5t?d? = (a? — d’)(b? — t”). 

So d? divides b? — t? and b? — t? divides 5d?. By the same reasoning 
as previously, taking congruences modulo 8, we see that b? —t? = d?. 
It follows that 5t? = a? — d?, hence 

(8.13) P=aP?+? and b? + 4t? = @?. 


By (1A) there exist integers m,n > 0 and integers r,s > 0 such 
that 


= 2 2 
(8.14) oe TNS 


t = 2mn, 


and 


b=r? —3?, 
oP) { 2t = 2rs. 
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We obtain the relations 


(8.16) risv=m +n? 
and 
(8.17) rs = 2mn. 


Since rs < xy, we have again reached a contradiction to the mini- 
mality of the solution (a, y, u,v). 


Using these facts, Breusch proved: 
(8C) There exist no positive integers x,y,z such that «© +y°® = 2°. 


PROOF. Assume on the contrary that x,y,z are pairwise relatively 
prime positive integers such that «2° + y° = z°. Then 3 does not 
divide both x and y, so we may assume that 3 /x. We have 


a = 2 —y = (zt y(z-y(2 + zyty (2 —zyt+y’). 


The last two factors in the above product must be odd, since x, y are 
not both even. Moreover, it is easily seen that each factor 27+ zy+y? 
and z? — zy+y? is relatively prime to the three other factors of the 
right-hand side (because 3 / x). Hence by the unique factorization 
of integers, 


{ z+ zyty? =D, 
e—zytyae, 


with integers b > c > 0. Adding and subtracting these relations, we 
have 2(z* + y”) = 6° + c® and 2zy = b° —c®. But b,c are odd, so 
be +c? = 2m, & — c? = 2n (with m > 0,n > 0), so squaring and 
adding, we get b©+c® = 2(m?+ 7), and multiplying, b° —c® = 4mn. 
Hence 27+ y? = m?4 n?, zy = 2mn. According to (8A), this is 
impossible. 


(8D) There exist no positive integers x,y,z such that 


gO 4 yi0 — 710. 
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PRooF. Let x,y,z be positive, pairwise relatively prime integers 
such that 21° + y!° = z!°. Then 5 does not divide both x and y, 
so we may assume 5 /x. We have 
gS ey 
= (zt y)(2—y)(Z + ey + 27y? + zy? +") 
x (24 — zy + 27y? — zy? +y’). 


4 


The last two factors in the above product must be odd, since y, z are 
not both even. Moreover, it is easily seen that each of the last two 
factors is relatively prime to the three other factors on the right-hand 
side (since 5 a). Hence by the unique factorization of integers, 


7 — zy om be gry? + zy? — yf — (hae 
2 By t Py? — zy + yt = c%, 


with integers b > c > 0. Adding and subtracting these relations, we 
have 2(24 + 2?y? + yt) = B10 4 cl, 2(z3y + zy?) = b'° — cl. But 
b,c are odd, so 6°? +c? = 2m, b? — c? = 2n (with m > 0,n > 0), so 
squaring and adding, we get b'°+c'° = 2(m?+n?), and multiplying, 
519 — cl? = 4mn. Hence 
4 2y+yt =m? 4+n?2, 
{ zy(z7 + y’?) = 2mn. 


Now letting 27+ y? =r, zy = s, then r? — s? = m?+4+ n?, rs = 2mn. 
This is impossible by (8B). 


Fermat’s theorem for the exponent 14 was established by Dirichlet 
(1832) before Lamé settled the case of exponent 7. Dirichlet also 
showed: 

(8E) The equation 

x" =) yi4 = gma7lin gi4 
(with m > 0,n > 0) has no solution in nonzero integers x,y,z with 
x,y relatively prime. 


Terjanian proved in 1974: 


(8F) Ifa isa natural number, a 4 0, multiple of 7, and without 
prime factor p=1 (mod 7), if x,y,z are natural numbers, x,y # 0, 
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x,y relatively prime, if 


then x =y=1 andz=0. 


From this, he deduced a simple proof of Fermat’s theorem for the 
exponent 14. 

In 1885, Matthews gave a proof for the first case of Fermat’s the- 
orem for the exponents 11 and 17. In 1948, Fell indicated a distinct 
proof for 11 and claimed that his method also solved the first case 
for the exponents 17 and 23. 
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1.9. Appendix 
Would this Be Fermat’s “Marvelous Proof” ?® 


We can see in Euler’s Opera Posthuma Mathematica et Physica, 
Petropoli, 1862, Vol. 1, pp. 231-232, in “Fragmenta arithmetica 
ex adversariis mathematicis deprompta” the following argument, at- 
tributed to A.J. Lexell. It represents an attempt to apply the method 
of infinite descent to Fermat’s equation, in a way which could have 
been Fermat’s. 

The attempt failed. 

Assume that FLT is false for some exponent n > 2. We may 
assume n to be an odd prime (in the argument we only require n 
to be odd) and that there exist nonzero integers (not necessarily 
positive) a,b,c, such that a” + b” = c” and c is even, a and b are 
odd, a # b, and gcd(a, b,c) = 1. Let 


Cie, 

n= (a” = b”) /2, 

2S abe" "12 
sO X,Y, z are integers, © is even. Then 


rty=a", 
xr—-y=b", 


Ag? C2 x , 
It follows that 


x 
= gr ty? = (g-Vi2y)?, 


Let d = ged(z, z), sod = c”~*/2 because gcd(ab,c) = 1. Let x = dz’, 
z = dz’, so x’ is even, gcd(a’, 2’) = 1 and d"t!a'(a'" — 42'") isa 
square, hence so is 2/(a’" — 4z'"), with the two factors relatively 
prime. Thus there exist integers r,s such that 


hence 


>I am indebted to E. Bombieri who called my attention to what follows. 
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So 
rr — 5? = (r" + 8)(r" — 8) = 42". 
Since r = s (mod 2), then gcd(r” + s,r” — s) = 2, so 
r? + 5 = 20?, 
r’™ —§ = 2u”, 
and adding, we obtain r” = t”+u”, which gives a nontrivial solution 


of Fermat’s equation, with r even. If we would have r < c then by 
descent we would reach a contradiction. However, 


so r =c and the descent method is not applicable. 


I] 
A Interludes 


In this chapter we discuss topics which will be required in the sub- 
sequent developments. Their importance is not restricted to their 
applications to Fermat’s last theorem. 


II.1. p-Adic Valuations 


Let p be a prime, let a be a nonzero integer, and let v,(a) be the 
exponent of p in the factorization of a as a product of prime-powers: 


a=p”%b, where p /d, 


vp(a) is the p-adic value of a. By convention we also set v,(0) = co. 
We note: 


up(ab) = vp(a) + vp(d); 
(1.1) vp(a +b) > min{v,(a), Up(d)}; 
if vp(a) < Vp(b1), Vp(b2),--- , Up(dx) 
then vp(a + b) + bg + +--+ dy) = v,(a). 


If v,(a) =e > 1 we say that p® is the exact power of p dividing a 
and we write p*® || a. 
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More generally, if gcd(a,a’) = 1, a’ > 0, we define v, (a/a’) = 
vp(a) — v,(a’). Then for any rational numbers r,s, the above prop- 
erties (1.1) are still satisfied. The mapping v, : Q— ZU {oo} is the 
p-adic valuation of Q. 

We say that r = a/a’ is p-integral if v,(r) > 0. Clearly every 
integer is p-integral (for every prime p). The set Z, of p-integral 
rational numbers is a subring of the field Q of rational numbers. 
Explicitly, r € Z, if and only if r = 0 or r = p*a/b with k > 0, b > 
0, gced(a, b) =; p ya, pb. 

If r,s € Q,e > 1, we write r = s (mod p*) when v,(r — s) > e; 
we also say that p° divides r — s (with respect to Z,). This relation 
of congruence satisfies the ordinary properties of the congruence of 
integers modulo a natural number. 

It is also clear that a rational number r is in Z if and only if r is 
p- integral for every prime p. 

The following two results have numerous applications. For any 
real numbers let [x] denote the unique integer such that [a] <a < 
[x] + 1. [a] is called the integral part of x. 

In 1808, Legendre determined the exact power p™ of the prime p 
that divides a factorial a! (so p™*! does not divide a!). There is a 
very nice expression of m in terms of the p-adic development of a: 


a = agp” + ax_ip** +--+ + ap + ao, 
where p* < a < p*t! and 0 < a; < p—1 (fori =0,1,... ,k). The 
integers ao, @1,... , Qe are the digits of a in base p. 


For example, in base 5, we have 328 = 2 x 5° + 3 x 57+ 3, so the 
digits of 328 in base 5 are 2, 3, 0, 3. Using the above notation: 


(1A) JIfa>1 then v,(a!) =m where 


wa fa] — a— (ao +a, ++++ + ax) 
m=>|5]= pal 


i=l 


PROOF. By definition a! = pb, where p /b. Let a = q ptr; with 
0<m,0< 1" < p; so qm = [a/p|. The multiples of p, not bigger 
than a, are p,2p,... ,qp <a. So p"™(q!) = pv’, where p /'b’. Thus 
g. +m, = m, where p™ is the exact power of p which divides q!. 
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Since q, < a, by induction, 


but 


as may be easily verified. So 


alae 


Now we derive the second expression, involving the p-adic digits 
of a= agp" + -+-+a,;p+ ag. Then 


a 
|] = agp’ +--+ +41, 


a 
| = agp’? +--+ +42, 


>|] = a, + ao(pt 1) +a3(p? +p+1)4+-:: 


+ ax(p** + p*? +---+-p+1) 


——fa(p —1) +.a,(p? —1) +--+ +ax(p* — 1)} 
p 


1 
= pa eee 


In 1852, Kummer used Legendre’s result to determine the exact 
power p™ of p dividing a binomial coefficient 


a+b\ (a+b)! 
Googe ales @ 


wherea>1,b>1. 
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(1B) The exact power of p dividing ae as equal to Eg +ey+-+-+€p, 


which is the number of “carry-overs” when performing the addition 
of a,b written in base p. 


PRooF. Let 
a= a tapt---+ap', 
b= bo +bip+---+ hip’, 


where 0 < a; < p—1,0 < 5; < p—1, and either a, 4 0 or bh} £ 0. 
Let S, = ae a;, Sy = ee b; be the sums of the p-adic digits of 
a,b. Let ¢;,0 <c; <p—1, and ¢; = 0 or 1, be defined successively 
as follows: 


ao + bo Eogp + Co, 
ep tay +b) = eypt+a, 
€y tag + bo = En:p+ ca, 


e-itath = pte. 


Multiplying these equations successively by 1, p, p?, ... and adding 
them, 


atb+eqpt erp? +--+: +e-i1p’ = eopt+ erp? +--+ + ep 
+ co teypt +++ + cp". 


So,atb=ceteqpt:--+cep'+ep'', and this is the expression 
of a+ 6 in the base p. Similarly, by adding those equations, 


Sa + Sp + (Eo +é1++*+ + &e-1) = (Co ter +++ + 2)p + Sore — Et 
By Legendre’s result, 
(p—1)m = (a+b) —Sa4p —€+S,-64+ S, 
= (p—l)(eo+e1+---+&). 


Hence, the result of Kummer. 


This theorem of Kummer was rediscovered by Lucas in 1878. 
In 1991, Frasnay extended the result replacing integers by p-adic 
integers.! 


! This result is apparently still unpublished; a preprint was given to the 
author. 


II.2. Cyclotomic Polynomials C6 


The results of Legendre and Kummer have found many applica- 
tions in the so-called p-adic Analysis. 
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11.2. Cyclotomic Polynomials 


Let n > 1 and ¢, = cos(27/n) + isin(27/n). SoG, = 1, @ = —1, 
C3 = (=1 + iV3)/2, Ga = a, Cs = cos 72° + 7sin We Ge — (1 a iv’3)/2, 
etc.... 

All the powers of ¢, are also nth roots of 1, so they are roots of the 
polynomial X” —1. If w is any nth root of 1, the smallest d > 1 such 
that w? = 1 is called the order of the root of unity w and necessarily 
d divides n; we say then that w is a primitive dth root of gq. 

The powers ¢/ for 7 = 1, 2,...,n are all distinct with ¢” = 1, 
so ¢, is a primitive root of unity of order n. Since there are n nth 
roots of 1, every nth-root of 1 is a power ¢/. Moreover, as is easily 
seen, ¢/ is a primitive nth root of 1 if and only if gcd(j,n) = 1. Thus 
the number of primitive nth roots of 1 is equal to y(n), where y(n) 
denotes the totient of n and y is Euler’s function. 

The nth cyclotomic polynomial is 


(2.1) ®(X)= JT (x-G) 


gcd(j,n)=1 


(product for all 7,1 <7 <n, ged(j,n) = 1). It is a monic polyno- 
mial of degree y(n). Since the polynomial remains invariant by the 
permutation of its roots, from Galois theory the coefficients are in 
Z. 
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By grouping the nth roots of 1 according to their order, we obtain 


(2.2) X"-1=]] %u(X). 
d| 


From this, or directly, we obtain 


(2.3) ®,(X)=XP 1+ XP P+. 4+ X41, 

XP 1 

Xp — J 

— xP '(e-1) 4+ XPo'(P-2) Saas ee Cai oat 


(2.4) ®,(X)= 


for any prime p and e > 1. 
If m|n and m 4 n we have therefore 


(2.5) X"—-1=(X™"—-1)0,(X)][ @a(X) 


(product for all d,1 <d<n,d|n,d/m). Let yu be the Mobius 
function 


_ f Oif the primes p; are not distinct, 
20)s sEDE DE) = { (—1)” otherwise. 


Then 


(2.7) ®,(X) = ][(xr/4 — 1)". 
din 
We note the following properties: If p is a prime and p divides m 
then 
(2.8) ®ym(X) = Op (X?) (when p | m). 
If p does not divide m and s > 1 then 


@,,,(X” ) 
(2.9) Dp2m(X) = &,, (xP) (when Pp \m, Ss a 1). 
We now consider the corresponding homogenized polynomials in 
two indeterminates. Let 


x 
(2.10) ®, (X,Y) =V°Ms,, (>) 
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then 
(2.11) x" —Yy" =|] ,(X,Y). 


dln 
Ifm|n and m¥n then 


(2.12) X”"—Y" =(X™—Y™)®, (X,Y) ]] a(X, Y) 


d 
(product for all d, 1<d<n,d|n,d)/m). We have also 
(d) 
(2.13) &,(X,¥) =[] (xr¢-yrey"™, 
d\n 
and, as before, 


(2.14) Biym(X, VY) = On(X?, Y?) (when p|m), 


D(X? , VP") 


2.15) ®ysm(X,Y) = a =A 
( ) Pp ( ) ®,,, (XP ayn ) 


(when p /m, s > 1). 


II.3. Factors of Binomials 


Let a,b be nonzero distinct integers. In this section we consider 
binomials a” + b” as well as the integers (a” — b")/(a — b) and we 
discuss their factors. 


(3A) Let a,b be nonzero distinct integers. 
(1) Ifp4#2,p/ab, and v,(a— b) =e > 1, then v, (a? — b?) = 
e+r for every r > 1. 
(2) If 2 Yab and v2(a—b) =e > 2, then v2 (a® —b?) =e+r for 
every r > 1. 
(3) If p is any prime and p | a? — b?, then p* | a? — b?. 


PRooF. (1) It suffices to show that v,(a? — b?) = e+ 1, and then 
repeat the argument. By hypothesis, a = b+ kp*, where p /k. Then 


a? = b? + (") bP kp® + () bP? 22° ih ol KP pre, 
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Since p divides (*) for j =1,...,p—1, then 


Us (") nip >1+ Je. 


From v,(k?p?°) = pe, it follows that v,(a? — b?) =e+1. 

(2) As in (1), it suffices to show that v2(a? — b?) =e +1. By 
hypothesis, a = b + 2°k with e > 2 and a,b,k odd. Then a? = 
b? + 2°+1k + 2°°k?; since e+ 1 < 2e, then v2(a? — b?) =e +1. 

(3) By hypothesis, a = a? = b? = b (mod p); raising to the pth 
power, a? = b? (mod p?), so p? | a? — bP. 


If n > 1 and a,b are distinct nonzero integers, let 


ge aph 


oy akon ik, 


a—b — 


(3.1) 0, (0,0) = 


By convention we define Q (a,b) = 0. We note the following expres- 
sion for Q,,(a,b) (n > 1): 
[((a — b) + B]” — b” 

a—b 


(3.2) OQ (aeb) = 


+( . Jee — b)b"-? + npr} 
n—2 


= (a—b)e+nb™, 
where e € Z. Also, if n = p is a prime number, then 


(3.3) Q,(a,b) = (a — bP" + pf, 


where f € Z. 

We shall now indicate some properties of the integers Q,,(a, b). 
Jacquemet (before 1729) proved (5) below when n = p is an odd 
prime. Euler proved in 1738: if p is an odd prime, a > 1, then 
gcd (Q,(a,+1),a +1) =1 or p; if p divides a+1 then v,(Q,(a, £1)) 
= 1; moreover, if p # 3, a # 2, then Q,(a,+1) is odd and greater 
than 1. 

In 1769 Lagrange proved (4) below, as well as (6) when n = pisa 
prime number. In 1837, Kummer proved (4) when n = p is a prime 
number. In 1888, Sylvester proved a special case of (3). In 1897, F. 
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Lucas proved special cases of (5) and (6). More recently, properties 
(4), (5), (6), and (7) have been established by Inkeri (1946) and 
Vivanti (1947). In the present form, most of the proposition is in 
Moller’s paper (1955). 


(3B) Letn > m > 1, and let a,b be nonzero distinct integers. 
Then 


(1) If n=mq-+r with r > 0 then 


Qn(a, b) = a"Q,(a™, Bb”) Qn (a, b) + 6""Q,(a, 6). 
Ifn=mq—-—r with r > 0 then 


Q,(a, b) = a" Cala’ b”) + pro | Qm(a, b) 
= GO EO as b). 


Assuming a,b relatively prime, we have: 


(2) Ifd= gcd(n, m) then Qala, b) = ged (Q,(4, b), Qm(a, b)). 
3) [pin Qp(a, 6) divides Q,,(a, 6). 
) gcd(Q,,(a, b), a— b) = gcd(n, a— b). 
) If p|a—b, p/n then p /Q,,(a, ). 
) If p is an odd prime dividing a—b then Vy (Q,,(a, b)) = vp(n). 
) If 4|a— 6 then v2(Qnr(a, b)) = v2(n). 
If 2|a—b but 4 Va—b then v2 (Qn(a, b)) > ve(n). 
(8) If n is odd then Q,(a,b) is odd. 
(9) If n is odd and e > 0 then gcd(Q,(a, b), a2" + b?'") =1. 
(10) If every prime factor of n divides a—b then n(a— b) divides 
a” — Bb”. 


( 
(4 
(5 
(6 
(7 


Proor. (1) Let n=mq+r. Then 


qmatr = pmatr 


Qn(a,b) = Pn 
Gn Aen tobe 
7 a—b 
re SO a SO 2) 
=a’ x = rae x om 
saying eR OS Oe eae 
am — pm a—b a—b 


a’Q,(a™, 6”) Qm(a, b) + Q-(a, b)b™. 
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Now let n = mq —r. Then 


fare aka”, b”) + pee Qm(a, b) _ a?" "O(a: b) 


aE aU fois 
=a 
q™ bm a-— b a— b 
a” —b" 
—g™-TEr-™ 
a—b 
7 qa" — am" plq-)m + agm™pr-™ mae b” —- a™pr-™ + gmt pn-mtr 
= a—b 
qa" — br 
= = Q,,(a, b). 
a— b Q ( o) ) 


(2) Since a, bare relatively prime then for every k > 1, a and b are 
also relatively prime to Q;(a,b) = a*-!+a*-?b+---+ ab*-? + dF}. 
By (1), Qa(a, 6) divides Q,,(a,b) and Q,,(a,b). Let r,s be positive 
integers such that d = sm —rn (or d=rn—sm). So sm =rn+d 
and by (1), 


Q.(a”™, b” )Qm (a, b) — Qem(a, b) — a“Q,(a”, b")Qn(a, b)+b"" Qala, b). 


If t divides Q,,(a,b) and Q,,(a,b) then t divides 6'"Qa(a,b); but 
Q,(a,b) and 6 are relatively prime, hence ¢t | Qa(a,b), showing the 
statement. 

(3) By (2), the integers Q,(a,b) (for primes p dividing n) are 
pairwise relatively prime. By (1) if p|n then Q,(a, b) divides Q,, (a, b). 
Hence JJ, Qp(a, 6) divides Q,(a, 6). 

(4) Since gcd(a, b) = 1, it follows from (3.2) that gcd(Q,,(a, 6), a— 
b) = gcd(n, a — Db). 

(5) This is an obvious consequence of (4). 

(6) Letn=p'm,plm,r >0s0 v,(n) =r. Let a) = a™, b) = 
b”™. Since p | a— 6b, by (5) p does not divide Q,,(a,b) = (a™ — 
b™)/(a—b) = (a, —b,)/(a—}), so v, (a; — 61) = v,(a— 6) > 1. Hence 
vay” _ bt”) = v,(a, — b,) +r. Thus 


Up (Qn(a, b)) = (LaF) + »(“ — ) =7 = 050) 


a, — by 


(7) Let n = 2m, 2 /m, r > 0, so ve(n) = r. Let ay = a™, db) = 
b™. As in (6), ve(ai — b1) = vo(a-—b) =e > 1. Ife > 2 then 
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However, if e = 1 then v2 (a7 — b?') > r+ 1 (and it may be greater 
than r+ 1), so we can only conclude that v2 (Qn(a,b)) > ve(n). 

(8) Ifa#b (mod 2) then a” 4 b” (mod 2) so Qn(a,b) is odd. 
If a = b (mod 2) since a,b are relatively prime, they are odd. It 
follows that Q,(a,b) = a""'+a"*b+---+ab"-? +6" is the sum 
of an odd number of odd summands, so it is odd. 

(9) Let p bea prime, r > 1, and p" | Q,,(a,b), p” | a "+b?”. By 
(7), p #2. Since p” | a” — 6" then a?” = 6?" (mod p’) so p” | 2a?” 
Therefore p divides a, hence also b, which is not possible. 

(10) Let p be any prime factor of n; by hypothesis p | a — 8, 
hence by (6) or (7), vp(n) < vp (Qn(a, b)), therefore v,(n(a — b)) < 
Up(Qn(a, b)(a — b)) = v,(a” — 6"). Since p is arbitrary, this shows 
that n(a — b) divides a” — b”. 


We indicate now the following complementary result proved by 
Inkeri in 1946: 


(3C) Let p be an odd prime, n > 1, and let a,b be nonzero 


relatively prime integers such meat a#b. Then: 


(1) Qp»(a, b) = Ts 1 Qp(aP™ > eo 
(2) If p does not divide ar mae then the integers a—b, Q,(a, b), 


Q,(a?,b?),...,Qp(a?, pe" *) are pairwise relatively prime. 
(3) If p | a?” — bv", if i,j are integers such that1 <i<j<n 
then 


ged (2, (a, B") Q, (ar) = p, 
ocd (a =O; (a” ae pr’) <p, 


(4) If vp (a?" — bP") =e>1 thene >n+1 and v,(a—b) = e—n. 


PRooF. (1) 


er = aT] Q,(aP be). 


m=1 m=1 
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(2) We have a?” — bP” = qoqi-+- dn where qq =a—b, 


a?’ — bP 


j-1 pj-1 
qj = gpl! — pp} = Qp (a” bP ) ; 


By hypothesis, p / q; (for j = 0,1,...,n). If J is a prime and | 
divides qi,q; (with 0 <i<j <n), then! 4 pand/| a? —b?’. Since 


up Jj 
pi pi aP = bP 
Qpi-i a? 0 = a — bp — pp? = Gi414i42°°° Uj, 


then | divides Q,;-1(a” , b? ). By (3B)(4), we have I = p, a contra- 
diction. 

(3) Assume that / is a prime, e > 1, and /* divides gq; and 
q; (0 < i <j <n). Then [® divides a?’ — b”’ and also a?’ — 
bP * = (a? — BP) qin: -qj-1. Since I* divides Q,(a”’, bP) = dij; 
by (3B)(4) l* divides p, so I° = p. 

(4) If p divides a?” — 0?" then p Ja, p /b. From 


n—-1 


a= (ae) = (we *)" = b?"" (mod p), 


n-1 1 


it follows that p | a?” ’ — 6?" '. By (3B)(6), vp(Q,(a?" ', B®" *)) 
= 1, hence v,(a?" ' — b®""') =e—1, with e—1> 1. Repeating this 
argument, v,(a— b) =e—nwithe >n+1. 


We shall need later the following estimates: 
(3D) Let n be odd, n > 3, and let a,b be nonzero distinct integers 


such thata+b>1. Then Q,(a,b) >n. The equality holds exactly 
when n = 3,a=2,b=-1 orn=3,a=-1, b=2. 


PROOF. Since Q,,(a,b) = Q,,(b,a) and a 4 b, we may assume with- 
out loss of generality that a > b. If b> 1 then 


Qn(a,b) =a™*+a"7b+---+ab™? 4671 >1414---+1l=n. 
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Ifb<—1lthena>1—6> 2, b? >1 and 


Qn(a,b) = (a+ b)(a*? +a" *b? +--+ ab™*) + 0? 
> QUA 2 sh ONS ee Leet 


= (40-97 4 Alr8)/2 4g 1) 43) 


4(n-1)/2 Baa 
=o p1=2(2"*-1)41 


— >n 


when n > 3. If we have Q,,(a, b) = n then we must have equalities all 
through, and this requires that n = 3, a = 2, b= —1. By symmetry, 
we may have Q,,(a, b) = n also when n = 3, a= —1,b=2 


We note explicitly that if p is an odd prime, if a > b > 1 then 
Q,(a,b) > p. This fact is a special case of the above result, but may 
also be seen directly, since Q,(a,b) = a?~' + a?-*b4+---+ abP-? 4 
ppl > p. 

Before proceeding, we note: If 1 < b < a, by (2.13), ®©,(a,b) > 0. 
If n is odd, n > 3, and a,b are not both even, then ®,,(a,b) is 
odd. Also, if n is odd, n > 3, and a,b are not both even, then 
®,,,(a, b) = ®,,(—a, b) is odd. 

Let a,b be nonzero distinct relatively prime integers, let n > 1. 
We say that the prime p is a primitive factor of a" — b” (respectively, 
a” + b”) if p divides a” — b” (respectively, a” + 6”) but p does not 
divide a” — b™ (respectively, a” + 6”) for every m,1<m<n. We 
first note that if 2 is a primitive factor of a” +b” then n = 1, because 
a = b (mod 2). 

Next we observe: 


(3E) Ifn>2 then p is a primitive factor of a” + b” if and only 
if p is a primitive factor of a?” — b?”. 


ProoF. If p is a primitive factor of a?” — b?" then p / a” — b” so 
p|a" +b"; moreover if p | a* +b* where 1 < k < n then p | a?* —b?*, 
so 2k = 2n, showing that p is a primitive factor of a” + b”. 
Conversely, if p is a primitive factor of a” + 6” (with n > 2) then 
p#2and p| a?” — bo". If1<k < 2nand p| a* — B*, let k = 2°m, 
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with e > 1 and m odd. Then 
a’ =pF = (a'™ - ee) Came + pe) : 


Since 2°-'m <n, by hypothesis p / a2 '™ + 6? '™, so p | a2 '™ — 
b?"™, Repeating this argument, p divides a™ — 6”, where m is 
odd. Thus a” = 6” (mod p) and also a” = —b" (mod p). Then 
a™”™ = b"" (mod p), a” = —b"™" (mod p) hence p | b (because 
p # 2), so p | a, contrary to the hypothesis. Thus p is a primitive 
factor of a?” — b?”. 


The first result is very easy to prove! 


(3F) Let a,b be nonzero relatively prime integers, let n > 1, and 
let p be a prime. The following statements are equivalent: 

(1) p is a primitive factor of a” — b”. 

(2) p| a” — b”, but if 1 <m<n,m|n, then p a” — b”. 

(3) p| ®, (a,b), but if 1<m<n, then p /®,,(a,b). 

(4) p| ®,(a,6), but ifl1<m<n,m|n, then p /®,,(a, 6). 

(5) p |b and if b’ is such that bb! = 1 (mod p) then the order of 

ab’! modulo p is equal to n. 


Proor. The implications (1) — (2) and (3) — (4) are trivial. The 
equivalences (1) < (3) and (2) < (4) follow at once from the ex- 
pression a" — 6” = [],, ®a(a, b). 

Noting that (ab’)4 = 1 (mod p) holds if and only if p | a? — 6%, 
then (5) is obviously equivalent to (1) and to (2). 


Let p be a prime not dividing n. Then the congruence X” —1 = 0 
(mod p) does not have double roots, since the derivative nX”~' has 
only the root 0 modulo p. It follows from (2.5) that if a is an integer, 
a #1, and p/a if ®,_;(a) = D (mod p) then ©,,(a) # 0 (mod p) 
for every m, 1 <m< p—1, m dividing p — 1. 

From this observation and from (3F) we deduce: If a £ 1 and p 
does not divide a, then a is a primitive root modulo p (i.e., the order 
of a modulo p is p — 1) if and only if ®,_,;(a) =0 (mod p). 

The following proposition appears in the paper of Birkhoff and 
Vandiver (1904) and once more in Inkeri’s paper (1946) for the case 
where n is an odd prime. The inclusion E, C FE», was first shown by 
Legendre (1830). 
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(3G) Letn>2. With the preceding hypothesis, the following sets 
of primes coincide: 
E,: the set of primitive factors of a” — b”; 
E,: the set of primes p such that p= 1 (mod n) and p | ®,,(a, b); 
and 
E3: the set of primes p such that p/n and p | ®,(a, b). 


PROOF. We show that E, C Ey. By hypothesis, p | a” — b” but 
py a™ — b™ for all m,1 < m <n. By (8F), if 6b’ = 1 (mod p) 
then the order of ab! modulo p is n. Hence n | p— 1. Moreover, if 
1<m<n,m|n, then ®,,(a,b) divides a” — b™; since p /a™ — b™ 
then p |) ®,,,(a,b). But p divides a” — b” = J],,,,,, ®m(a, 6), hence p 
divides ®,,(a, b). 

Obviously Ey C E3, because if n | p—1 then n < p, hence p Jn. 

Now we show that £3 C Fy. Clearly p | a” — 6”. Suppose p is not 
a primitive factor of a”—b”. Then there exists m, 1<m<n,m|n, 
such that p | a” — 6”. From 


a” — b” = ©,,(a, b)(a™ — 6”) II ®,(a, b), 
d£én,d|n,d){m 


by hypothesis p divides (a” —b")/(a™ —b™). We write n = md, a™ = 
a,, 6” = b,, hence 


min 


q" — bn at > bt d-1 a 
Sap = Sp = Lat; = da" (mod p). 


i=0 


But p | a,—b;, so p /'a;, hence p | d, so p | n, which is a contradiction, 
concluding the proof. 


Next we prove: 


(3H) Let a,b be nonzero distinct relatively prime integers, and 
n> 1. Let p be a primitive factor of at — b*, let v,(a4 — b*) =r >1 
and assume that r > 2 ifp=2. Then 

(1) u,(@a(a, b)) =r; 

(2) ift > 1 then vp (®ay(a, b)) = 1; and 

(3) ift>0,k >1, pk then v,(®xap+(a, b)) = 0. 


Proor. (1) By (3F), p /®,(a,}) for all 1, 1 <1 <d. It follows that 
Up(Pa(a, b)) = vp(a* — b). 
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(2) We have, by (2.12), 


t—1 t—1 


a? — bh” = ®y. (a,b) - (a”” — >? 


II ®., (a, b). 


eld, 1<e<d 


We note that p /®.,+(a,b) when e < d, otherwise p | a’ — b°’; since 
p is a primitive factor of a? — b* then by (3F)(5), d divides ep’; but 
p dso d|e, a contradiction. But by (3B)(6) and (3B)(7), 


vp (a = o') —r+t, 


v, (ae — pe) i Maat aaa 
Therefore v,(®ap+(a, 6)) = 1. 
(3) We have 
atte’ _ pake’ — @4,2(a,b) I ®,(a,b) + (at = pi") 


eldkpt,eJ dpt, e<dkpt 


By (3B)(6) and (3B)(7), vp(a%*" — b%?") =r +t, v,(a® — b#) = 
r+t, hence v,(®axpt(a, b)) = 0. 


For every integer n > 2, let P[n] denote the largest prime factor 
of n. 


(31) Leta>b>1, let gcd(a,b) = 1, and letn > 2. Let p be a 
primitive factor of af — b! such that p| ®,(a,b). Then: 

(1) There exists 7 > 0 such thatn = fp’ with p/ f. 

(2) If 7 >0, then p= P[n]. 

(3) If 7 >0 and p* | ®,(a,b), thenn = p= 2. 

(4) gcd(®,,(a, b),n) =1 or Pin]. 


Proor. (1) By (2.11), ®,,(a,b) divides a” — 6"; then p | a” — b”, 
hence f | n by (3F). Since p| a?~' — b?-', again f | p—1, so f <p. 
Let n = fp’w with j > 0, p / fw. Write r= fp’. By (3.2), 
a” — 6” 
a” — br 


= wb"! (mod a” — 5"). 


Since p | a’ — b” (because f |r), then 
a” — b” 


oR = wb”! (mod p). 
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If n < m then by (2.12), 
pb, (because gcd(a, b) 
n= fp’. 

(2) From f < p, if 7 > 0, then p= P[n]. 

(3) Let 7 >1 and s= fp’—!, son = ps. Then 


®,,(a, b) divides (a" — b”)/(a" — b”). Since 
= 1), then p | w, which is absurd. So 


a” — bo” 7 [(a® — b°) + bs]? — b*? 
a’ — bs = a® — bs 


—_ posP-)) 4 o (aS uf b°)bs-?) 4+ () (as = bP) 77-9) 


+.+++(a* — 6°)? 7, 


If p > 3, since p | a* — bd’, then 


a” — b” 
as — b§ 


On the other hand, by (2.12), ®,,(a,b) divides (a” — b")/(a* — b*), 
hence p? | ®,,(a,b). Thus, if p? | ®,,(a,b), then necessarily p = 2. So 
f <p-—1implies f = 1 and n = 2’, so ®,,(a,b) #0 (mod r), which 
is absurd. This shows that 7 = 1 and n = 2. 

(4) Assume that there exists a prime p dividing gcd(®,,(a, b), n). 
By (1) and (2), p = Pin]. By (3), if p? | gcd(®,(a,b),n), then 
n = p= 2, so p* /n. This shows the assertion. 


=p (mod p’). 


The following very interesting theorem was proved by Bang (1886) 
in a particular case. In 1892, Zsigmondy proved the stronger ver- 
sion presented here. It was rediscovered by Birkhoff and Vandiver 
(1904) and by various other mathematicians, like Dickson (1905), 
Carmichael (1913), Kanold (1950), Artin (1955), Hering (1974), Lii- 
neburg (1981) and maybe others. 


(3J) Leta>b>1, gced(a,b)=1,n>1. 
(1) a” — 6b” has a primitive factor, with the following exceptions: 
(a) n=1,a—-b=1; 
(b) n= 2,a+b a power of 2; and 
(e) n= 6, 2=220= 1, 
(2) a" +b” has a primitive factor, with the following exception: 
n=3,a=2,b=1. 


90 II. 4 Interludes 


ProoFr. (1) It is clear that in cases (a), (b), (c), a” — b” does not 
have a primitive factor. If n = 1 and a—b does not have a primitive 
factor, then a—b=1. 

Let n = 2 and assume that a?—b? does not have a primitive factor. 
From a? — b* = (a+ b)(a — b) and ged(a + b, a— b) = 1 or 2, if p 
is an odd prime dividing a + b, then p divides a? — b?. But p is not 
a primitive factor, so p | a — b, hence p divides a and b, which is 
absurd. This shows that a+ 0 is a power of 2. 

Now let n > 3 and assume again that a” — b” does not have a 
primitive factor. Let p = P[n] and v,(®,,(a,b)) = 7 > 0. Define 


®,,(a, 
@: (a,b) = POP). 
P 
(1°) Assume that ®*(a,b) = 1. Let G1, 2, --- Gyn) be the primi- 
tive nth roots of 1. From 
a a 
— GO) = |r —- Gi lh) Se 
|a — ¢;,b| ; ¢ >0(¢ ) a—b 
and a previous remark, 
p(n) 


®,,(a,b) = |®,(a,6)| = [] |a— Gb] > (a— 6)? > 1 = OF (a,b). 
i=l 
So j >1 and p| ®,,(a,b), hence p divides a” — b”; so p is a primitive 
factor of af —b!, where f divides n. By (31), gcd(n, ®(a,b)) = p and 
also p? /®,,(a, 6). 

In conclusion, ®,,(a,b) = p, because ®*(a,b) = 1. Moreover, from 
p|n, it follows that p—1 divides y(n). This implies in turn that 
p = ®,(a,b) > (a—b)?™ > (a—b)?-}, hence a—b=1. 

If p? | n let n = pm, then p—1 < y(m) and by (2.14) 

p= ®, (a,b) = ®,,(a? — b?) > (a? — pry etm) > (a? — oP)P-t, 
because p | m. Thus a? — b? = 1, which is not compatible with 
a—b=1. 

Thus, from (31), n = pf, p ) f, where p is a primitive factor of 
af — bf. Note also that f | p—1, so f < p. From y(n) = (p— 1)y(f) 
it follows that 
p(a®—b”) > p(al—b*) > ®,,(a,b)®5 (a,b) = By(a?—b”) > (aP—DP) PP), 


using (2.12). Therefore p > (a? — b?)?f)-!, hence necessarily y(f) = 
1, thus f =1 or f =2,son=por n= 2p. 
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If n = p, then p = ©, (a, b) = a? -* + a? -7b+---+ ab? 2+ bP 1 = 
(a? — b?)/(a — b) = a? — BP, and this is absurd because a — b = 1. If 
n = 2p, from 3 < p it follows from (2.12) that 


aP + bP 
atb- 


p= ®>,(a, b) = 


By (3D), necessarily a = 2, b= 1, and p= 3, son=6. 

(2°) Assume that a” — 6” does not have a primitive factor. If 
suffices to show that ®*(a,b) = 1 and the result follows from (1°). 
Let p be a prime dividing ®,,(a,b), so p | a” — b”. Then there exists 
f, dividing n, 1 < f <n, such that p is a primitive factor of af — b/. 
By (31), p = P[n] and ©,,(a,b) = p’ with j > 1. Hence ®*(a,b) = 1. 

(2) Ifn=3,a=2, b=1, then a” +b” = 2°+1 has no primitive 
factor. Conversely, ifn = 1 and a+ 6 > 2, so there is a primitive 
factor. 

If n = 2 and a?+0? does not have a primitive factor, then a?+b? = 
2* (with k > 2). Indeed, if p is an odd prime dividing a? + b?, then 
p|a+b, so p| a? —0?, hence p | 2a?; it follows that p | a and also 
p | b, which is absurd. From a? + b? = 2" (k > 2), gcd(a,b) = 1, 
it follows that a,b are odd, hence a? + b? = 2 (mod 4), which is a 
contradiction, proving that a? + b? has a primitive factor. 

If n > 3, it follows from (1) that a?" — b?” has a primitive factor 
p with the only exception n = 3, a= 2, b= 1. If p= 2 then a,b are 
odd, so 2 | a+ 6, which is not compatible with 2 being a primitive 
factor of a” — 6”. 

By (3E), a” + 6” has a primitive factor, with the exception indi- 
cated. 


It follows from this theorem and (3F) that if a > 2, then each 
number in the sequence 


@3(a), ®,(a), ®;(a), ®,(a), ®,(a), aay 


(with ®¢(a) deleted when a = 2) has a prime factor which is not a 
factor of any of the preceding numbers. 
The following results are also of interest: 


(3K) Let l1<m<n, anda>b>1, with gcd(a,b) = 1. If 
gcd(®,,,(a, b), 
®,,(a,b)) £1, then P[n| = gcd(®,,(a, b), ®,,(a, b)). 
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Proor. If n = 2, then m = 1. If gcd(a — 6,a +b) ¥ 1, then 
gcd(a — b, a+b) =2. 

Now assume n > 3. Let p be a prime and let e > 1 be such 
that p® | ®,,(a,b), p® | ®,(a,b). Then p | a” — b™, p | a” — 0", 
so p is not a primitive factor of a” — b". By (3G) p | n and 
by (31), p = P[nj, ®,(a,b) = pe,p/c, soe = 1. Since p was 
an arbitrary common divisor of ®,,(a,b), ®,,(a,b), this proves that 
P[n] = gcd(®,,(a, b), ®,,(a, b)). 


(3L) Let p be any prime, let 0 <i < jy, and leta > b> 
1, gcd(a,b) = 1. Then 


ged(y (a,b), ®y(a,8)) =f 5 YP ta 
Proor. By (3K), if d= gcd(®,:(a, b), ®,;(a,b)) 4 1 then d= p. 
Assume first that p #2. If p|a—b, then a” =a=b=b? 
(mod p) so by (3B), p divides ©, (a,b) = (a”’ — b”’)/(a””* — bP”). 
Similarly, p divides ®,:(a,b). Finally, if p /a — b, then iw Sam 
b = bv” (mod p), so pa” — b”’ and a fortiori p / ®,s (a,b). Thus, 
gcd(®,: (a,b), ®,i (a, b)) = 1. 
If p = 2, then ®,(a,b) = a—b and ®(a,b) = a? +b? (for 
k > 1). So if a= b (mod 2), then 2 divides gcd(®z: (a, b), Bai (a, b)), 
and conversely. 


The following corollary of (3L) will be useful: 


(3M) Ifa>b>1 are integers and n > 2 then Pla” —b"| >n 
and Pla” + b"] > 2n. 


PrRooF. We may assume without loss of generality that gcd(a, b) 
= 1. Indeed, let d = gcd(a,b) and let a = da,, b = db;, so a, > 
b; > 1 and ged(a;, b,) = 1. Moreover, a” + b” = d"(a} + b7), hence 
Pla? + b}] < Pla” + 6"); so it suffices to show that n < Pla} — bi] 
and 2n < Pla? + bi]. Thus, we assume gcd(a, b) = 1. 

(1) Te=2-b= 1 = 6 thema™—b" = 2°—1 = 63 = 3% 7 and 
P{2° — 1] = 7 > 6. In the other cases, by (3L) let p be a primitive 
factor of a” — 6”. By (3G), p = 1 (mod n) so p = 1+ kn, hence 
Pla” —b") >p>n. 


(2) 
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By (3L), let p be a primitive factor of a?” — b?”. By (3G), 


p = 1 (mod 2n), so p = 1+ 2kn > 2n. By a previous remark p is 


also a primitive factor of a” + b”. Hence Pla” + b"| > p> 2n. 


1729 


1738 


1769 


1830 


1837 
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II.4. The Resultant and Discriminant of Polynomials 


Let 


F(X,Y) = apX” +a,X" VY 4---+a,Y" (with ao £0), 
G(X, Y) = Wy X™ +b, X" OY +---+bnY™ (with by #0), 


where the coefficients a;, 6; belong to an integral domain A. 

We shall define the resultant of F,G, denoted by R(F,G) or also 
by Res(F,G). First, if m = 0 we define R(F, bo) = bj, while if n = 0 
we define R(ao, G) = aj’. In particular, R(ao, bo) = 1. Next, ifm 4 0 
and n # 0 we define R(f,g) to be the determinant of the following 
matrix with m+n rows and columns: 


ao ay ORS eee An 0 0 

0) ao ay eee eee An 0) 

0) 0) eee ao ay eee An . 

OS 20! $45. <A. Bye eet he Ne 

O° be. Bio ee ban oe. 20 

bh sbi Se OPS, Soe BHD 
note that there are m rows containing dp,... ,@, as entries, followed 
by n rows with bo,... , Om. 


R(F,G) is called the resultant of F(X,Y) and G(X, Y). R(F,G) 
is a polynomial with coefficients in Z of degree m in the coefficients 
a, and of degree n in the coefficients b;. 

The resultant of OF (X,Y)/OX, OF (X,Y)/OY is called the dis- 
criminant of F(X,Y): 

; OF OF 
Discr(F’) =—R (> +) . 

We recall the following well-known properties (see Bécher (1907) 

or Cohn (1974)): 


(4A) Let F(X,Y), G(X,Y) be binary forms of degrees, respec- 
tively, n >1,m->1. Then: 


(1) F(X,Y) has a (nonconstant) factor proportional to a factor 
of G(X, Y) if and only if R(F,G) = 0. 
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(2) Ifn > 2 then F(X,Y) has a multiple linear factor if and only 
if Discr(F’) = 0. 

(3) IF F(X, Y) = Ty (a X —a,¥) and G(X, ¥) = 11", (8X - 
B;Y) (with a;, ai’, B;, B;' elements of a field containing the 
coefficients of F,G and a;! # 0 for each i and 8,’ # 0 for 
each j) then 


n 


R(F.G) = [[ (an ov’) = (-1)"" [] F(8),8/) 


= [](Gi'o — Bja,') = (-1)™ | Lai’; = Gey) 


In particular, R(G, F) = (-1)"" R(F,G). 

(4) If H(X,Y) is also a binary form of degree l, then R(FG, H) 
= R(F,H)R(G, H) and R(H, FG) = R(H, F)R(H,C). 

(5) Ifn >m and H(X,Y) is a form of degree deg(H) =n —™m, 
then R(F — HG,G) = R(F,G). Similarly, if m > n and 
K(X,Y) is a form of degree deg(K) = m—n then R(F,G — 
KF) = R(F,G). 


Now let f(X),g(X) be any nonzero polynomials of degrees n,m, 
respectively. Let F(X,Y) =Y"f (X/Y), G(X,Y) =Y™g(X/Y) so 
F (X,Y), G(X,Y) are binary forms of degrees n,m, respectively. 

The resultant of f,g is by definition R(f,g) = R(F,G). The dis- 
criminant of f is, by definition: Discr(f) = R(f, f’). 

Ifn >1, m>1 and F(X,Y) = J], (aX — aiY), G(X,Y) = 
ITji1 (6; X — 6;Y) (as in (1A)), with a;’ 4 0, 6;’ 4 0, then a;/a,' 
are the roots of f(X), 3;/G;' are the roots of g(X). 

If f(X) = aX" + a,X" 1 +--+ + an (with n > 1, ao 4 0), let 
(X,Y) =¥"f (X/Y) = TI (aX — Y). 

The discriminants of the polynomial f(X) and of the binary form 
F'(X,Y) are related as follows: 


(4.1) Discr(F’) = — Discr(f). 


Indeed, the derivative of f(X) is 
f'(X) = nag X" 1" + (n —1)ayX" 7 + + ay 


the corresponding binary form is G(X,Y) = Y"'f’(X/Y). On 
the other hand, 0F/OX = Y"'f'(X/Y) = G(X,Y), OF /0Y = 
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nY" 1 f(X/Y) —- XY"? f! (X/Y) = (1/Y) [nF (X,Y) — XG(X,Y)]. 
So, we have, on the one hand 


R (= y=) =H (=) R (= =) = nao Discr(F). 


OX’ OY Ox OX’ OY 
On the other hand, 
OF _OF 
R (YS) = R(G,nF — XG) = R(G,nF) 


= R(G,n) - R(G, F) = n™1(-1)"° YR(F,G) 
= n2(-1) UR (F, f) = n™"Diser(f). 


For the convenience of the reader we write explicitly the prop- 
erties of the resultant and the discriminant for polynomials in one 
indeterminate. 


(4B) For polynomials f,g,h,k, with deg(f) =n, deg(g) =m: 
(2) Ifn<m and deg(h) < m—n then R(f,g) = R(f, g + fh). 
(3) R(hk,g) = R(h, g)- R(k, 9), 

Rig, hk) = R(g,h) + R(g,k). 

(4) R(f* 19) = [R(f,g)|* for every integer s > 1. 

(5) R((X — a)*,g) = [g(a)|* whereae A, s > 1. 

(6) If f = ao], (X — 00) and g = byl 4(X — B) then 


R(f,9) = ag'bo TT T](a— 85) 
= of TL o(as 
= (yop TL 18). 


(7) If f = aoTTiis(X — a4), then 
Discr(f) = (— 1) n(n— 1)/2q ae ‘TI —— a)? ; 


(8) If f =hk, deg(h) =r, and deg(k) = s, then 
Discr(f) = (—1)"*Discr(h)Discr(k)[R(h, k)]?. 
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(4C) 
(1) If f,g € A[X] are nonconstant and R(f,g) #0, then f,g are 
relatively prime. 
(2) IfA=K is a field, and if f,g © K[X] are relatively prime, 
then R(f,g) #0. 
Proor. (1) Assume that f,g have a common nonconstant factor 
he A[X]. So f =hf, and g = hg. By (4B)(3), 
AC, 9) = R(h, h) . R(h, 91) ‘ Rt h) : R(fi, 91) = 0. 


(2) Assume that f,g € K[X] are relatively prime. By Bézout’s 
theorem, there exist fi,g: € K[X] such that gif + fig = 1; in 
particular, deg(gif) = deg(fig). By (4B)(3), 


R(af, fig) = Rg, fi) Rlgi.g) RUS, A) RU, 9). 
If R(f,g) = 0, then R(gi f, fig) = 0. However, by (4B)(2), 
Rigif, fig) = Rigif, i= of) = Riaif, 1) = 1, 


which is a contradiction. 
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II] 
Algebraic Restrictions 


on Hypothetical Solutions 


Assume that n > 3 and x,y,z are nonzero pairwise relatively prime 
integers such that 


ge” + y” — lt 
In this chapter we derive algebraic relations which must be satisfied 
by x,y,z,n. In some cases, these lead to a contradiction showing 


that for the exponent in question, Fermat’s equation has only the 
trivial solution. 


IlI.1. The Relations of Barlow 


Let p be an odd prime and suppose that there exist nonzero pairwise 
relatively prime integers x,y,z such that x? + y? + z? = 0. To 
begin, we observe that «+ y+ 24 0. Indeed, x,y,z cannot be 
all positive (nor all negative), so we assume, for example, that « > 
0, y >0Oand z <0. Then (x+y)? > 2? +y? = —z? since x+y > —2, 
thus z+y+z2z#0. 

We shall indicate relations which the integers x, y, z must satisfy. 

The first such results were proved by Barlow (1810, 1811) and 
discovered independently by Abel in 1823, who stated them with- 
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out proof, in a letter to Holmboe. The results below were given 
with complete proofs by Legendre, as early as 1823, and were known 
to Sophie Germain. Later, they were rediscovered by Lindemann 
(1901, 1907) and appeared in papers by Catalan (1886), Tafelmacher 
(1892), Fleck (1909), Lind (1910), Bachmann (1919), James (1938), 
Raclis (1944), etc. 


(1A) [If there exist nonzero integers x,y,z such that x? + y? + 2? = 
0, gcd(a,y, z) = 1 and p does not divide z, then there exist relatively 
prime integers t,t,, not multiples of p, such that 


xP + yP 


=? z= tty. 
zty 1> 1 


rty=P?, 
Moreover, t, is odd and t, > 1. 


PRooF. From the hypothesis, x,y,z are pairwise relatively prime. 
Consider the integer 


(1.1) 
Q,(a, —y) = eae = 


Pt ast gery fh se ayes ry? a6 y?'. 
r+y 


Since r+ yt z= a? + y? +z? =0 (mod p) and p/z then p/x+y. 
By Chapter II, (3B), ged(x + y, Q,(x, —y)) = 1. 

From (—z)? = a? + y? = («+ y)Q,(«, —y) we conclude that x + 
y, Q,(x,—y) are pth powers, i.e., there exist integers t,t, such that 
g+y=t, Q,(z,—-y) = 2, so —z = tt and ged(#, t) = 1. 

We show that t, is odd. From (1.1) we see that Q,(x, —y) is the 
sum of an odd number of terms; among these terms, x?~' or y?~+ is 
odd (because x, y are not both even). Thus Q,(x,—y) must be odd, 
hence t, is also odd. Finally, from (1E) since x > y (or y > x) then 
n—y>1 (or y—a > 1), hence t? = Q,(2,—-y) = Q,(y,—2) > P, 50 
t, > 0 and in fact t,; > 1. 


If x,y,z satisfy x? + y? + z? = 0, if p does not divide x,y, z and 
if gcd(x, y, z) = 1 then by the previous result there exist integers 
r, 8,t,71,51,t,, not multiples of p, such that 


aty=t, (a +y)/(@+y)=tt, 2 
(1.2) ytz=7T, (y? “Te 2”) /(y a; z) — ie t= —TT), 
ztaaet, (P+ariztay=s, y 
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Moreover, r,s,t,71,51,t; are pairwise relatively prime, 71, 5,,¢, are 
odd and greater than 1. We note that r? + 5? +t? = 2(a+y+z) £0. 
By addition and subtraction, it follows that 


y = —sP + (rP + 5? + tP)/2 = (r? — 5? +t?) /2, 


x= —rP + (r? + 5? + t?)/2 = (—r? + 8? + t”)/2, 
(1:3) 
z= —tP + (r? + 5? + tP)/2 = (r? + 8? — tP)/2. 


We have the following complement of (1A) known to Sophie Ger- 
main and reproduced by Legendre (1823). Proofs were also given 
by Fleck (1909), Lind (1910), Pomey (1923) and again by Spunar 
(1928), James (1938), Pérez-Cacho (1958), and Draeger (1959) (in a 
different form). 


(1B) [fp is an odd prime not dividing z, every prime divisor q of 
t, is congruent to 1 modulo 2p. In particular, t; = 1 (mod 2p). Tf, 
moreover, p does not divide xyz, then every prime divisor of 715, t, is 
congruent to 1 modulo 2p”. In particular, r, = 1 (mod 2p”), s; = 1 
(mod 2p?) and t, = 1 (mod 2p’). 


PRooF. Let qg be a prime dividing t,. Then q divides x? + y? but 
q does not divide « + y = t? because ged(t,t,) = 1. By Chapter II, 
(3G), g=1 (mod p). Since g — 1 is even then g=1 (mod 2p). 

Now we suppose that p / xyz and that q is a prime dividing r), 
hence q divides x. Therefore q does not divide yz. We note also that 
gcd(r,ri) = 1, so q does not divide y + z. 

Thus we have y = t? (mod q), z = s? (mod gq), hence t? + 5s? = 
y+z#0 (mod q) and t? 4 gP = yP + 2? = —2? =0 (mod q). So q 
is a primitive factor of t? +s”. By Chapter II, (3E), q is a primitive 
factor of t2”” — s?””, hence by Chapter II, (3G), ¢g=1 (mod 2p”). In 
particular, r; = 1 (mod 2p’). 

The proof is similar for the prime factors of s; and ¢,. 


Now we give the relations which must be satisfied by would-be 
solutions in the second case; these facts (including n > 2) were 
known to Sophie Germain and were given by Legendre (1823). 


(1C) Let x,y,z be nonzero integers such that p divides z, x? + 
y? + 2? = 0 and gcd(z,y,z) = 1. Then there exist an integer n > 2 
and pairwise relatively prime integers r,s,t,7T1,81,t1, not multiples 
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of p, such that r,,81,t, are odd and greater than 1 and satisfy the 
relations 


(1.4) 
ety =prrite, (2? + y?)/(a@+y) = pth, 2=—p'th, 
YPeS i, (y?+2)/ytz)=r?, v= rn, 
zt+a=s?, (2? + a?)/(z+2)=s1, y=—S85). 


Moreover, if q is any prime dividing t, then q = 1 (mod p?); in 
particular, t, = 1 (mod 2p’). 


PROOF. 2,y,z2 are pairwise relatively prime. If p divides z then p 
does not divide x nor y, and fom r+ y+2= a? +y? +2? =0 
(mod p) it follows that «+ y= —z=0 (mod p). 

Let m > 2 be such that x + y = p™~!t’, where p does not divide 
t’. Let Q,(x, —y) = (x? +y?)/(a+y). Since p divides u= x+y then 
by Chapter IT, (3B)(6), vp (Q,(x, —y)) = u,»(p) = 1. So Q,(x, -y) = 
pti', pti’. By Chapter I, (3B)(4), gcd(x + y, Q,(x, -y)) = p, so 
gcd(t’, ty’) = 1. 

Since —z? = x? + y? = p™t’t;’ then by unique factorization p 
divides m, and ¢’, t,' are pth powers of integers. We may write 
m=opn, 


gP + y? 
= pti, 
u+y 
z= —p" tt, 


where n > 1, t,t, € Z, ged(t,t,) = 1, p does not divide ¢ nor f¢;. 
Since px and p Jy, by (1A) there exist integers r,71, 5,5, not 
multiples of p, such that 


ytz=r", (y?+2)/y+z)=r7, v=-rn, 


2+ = 5", (2? + 2?)/(z+2)=8!, y = —851, 


and gcd(t, r,s) = gced(t1,71, 51) = 1, ged(r, ri) = ged(s,5,) =1. The 
proofs that r,,s,,t, are odd and that r; > 1, s; > 1 are the same as 
in (1A). 

Now we show that t; > 1. By Chapter II, (3D), pt? = (a? + 
y?)/(«@ + y) = Qp(x,-y) = Qp(y,-x) = p, hence t; > 1 and it 
suffices to show that t; 4 1. If t; = 1 then again by the same result, 
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assuming for example x > y, this would imply: p = 3, x = 2, y= 1, 
hence 23 + 13 + 23 = 0 which is impossible. 

Now we show that if q is any prime factor of t, then gq = 1 
(mod p?). We have z = 0 (mod q) hence y = r? (mod q), x = s? 
(mod q) and 0 = —a? + y? + z? =r?’ +s” (mod q). On the other 
hand, q does not divide r? + s? (otherwise q divides x + y hence q 
divides t, contrary to the fact that gcd(t,t,) = 1). It follows from 
Chapter II, (3D), that g = 1 (mod p*). This implies that t; = 1 
(mod 2p”). 

It remains to show that n > 2. In fact, r? + s? = 2z+ (x+y) = 
—2p"tt, + p?’-'t? = 0 (mod p). By Chapter I, (3H), r? + s? = 0 
(mod p?). Since p | tt, it follows that 2z = (r? + s?) — p?r—!t? = 0 
(mod p?), son > 2. 


From (1.1) we deduce that r? + s? + pP?—'t? = 2(a+y+z) #0. 

The fact that the exact power of p dividing 7+y is p?”~+ has been 
proved again and again (even in 1955 by Stone) by authors unaware 
that this result has been known for a long time. 

We write the relations analogous to (1.3), assuming that p | z: 


(1.5) 
x= —rP + (r? + 5? + pert tP)/2 = (—r? + sP + perl tP) /2, 
y = —s? + (r? + 8? + pert?) /2 = (r? — sP + pre it?) /2, 
z= —pPr“"4eP +e (r? + 5? + phe lt?) /2 = (r? + 5? — perl tP) /2. 


In the case of a squarefree exponent it is still possible to indicate 
some relations which are reminiscent of the Barlow relations; see 
Stewart (1977): 


(1D) I[fn> 2 is asquare-free integer, if x,y,z are nonzero pairwise 
relatively prime integers such that x” + y” = z” (respectively, if n 
is odd and x” — y” = 2”), then z—y = 2“d"~'a” (respectively, 
z+y = 2"d"~-!a") where a,d are natural numbers, u is equal to 0 or 
1, and 2" and d divide n. 


PROOF. We first consider the case where x” + y” = z” and we write 
z—y=a'a” where a,a’ > 1 and for every prime p the p-adic value 
of a’ is Up(a’) <n. 

If p is a prime dividing a’ then p divides n. Otherwise, p / n 
and by Chapter IT, (3B)(5), p/ Qn(z,y) = (2" — y")/(z — y). So 
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vp(a’) + nvy(a) = vp(z — y) = Vp(2” — y”) = vp(x") = nv,(y). Hence 
n divides v,(a’) and n < v,(a’), which contradicts the hypothesis. 
This shows that p | n, so v,(n) = 1, because n is square free. 

Now we determine v,(a’) when p divides a’. First, let p be an 
odd prime. By Chapter II, (3B)(6), vp (Qn(z,y)) = up(n) = 1. So 
v,(a’)+nv,(a)+1 = vp(z—y) +p(Qn(Z,y)) = Up(2"—y") = Vp(z") = 
nv,(«x); hence v,(a’) = —1 (mod n) and therefore v,(a’) =n —1. 

Now let p = 2. If 4 | z—y, by the result already quoted, Part 
(7), we have vo(Qn(z, y)) = ve(n) = 1, so as above we conclude that 
v,(a’) =n—1. If 2 | z—-—y but 4/2 —y then v2(a’) + nve(a) = 
U2(z — y) = 1, so v2(a’) = 1. There remains the possibility that 2 
does not divide z — y. 

Putting these facts together, we may write z — y = 2%d"~1a”, 
where u = 0 or 1, 2“ divides n, and d”~' divides n. Now let n be 
odd and x” — y” = z”. Then x” + (—y)” = 2” and by the first part 
of the proof z+ y has the form indicated. 


In particular, if n is a square-free integer, n > 2, if x,y,z are 
nonzero pairwise relatively prime integers such that 2” + y” = 2”, 
then 

gS 0" dak, 

ZY ies Ts Magis ae 
and, moreover, if n is odd then 

eyes dy at 


where Qj, @2, 43, d,, dz, dz; are natural numbers, uy, U2, u3 are equal to 
O or 1, and 2”, 2%, 28, d,, dy, d3 divide n. 
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III.2. Secondary Relations for Hypothetical Solutions 


In 81 we have seen that if x,y, z are nonzero pairwise relatively prime 
integers, p an odd prime and if x? + y? + z? = 0 then the Bar- 
low relations must be satisfied; in particular, there exist integers 
r,8,t,11,51,t, satisfying certain properties. In this section, we give 
further properties which must be satisfied by these integers. 

If m,n are nonzero integers, gcd(m,n) = 1 and n is odd, let (m/n) 
denote the Jacobi symbol. 

The following consequence of Barlow’s relations was first indicated 
by Pierre in 1943. 


(2A) Let x,y,z be nonzero relatively prime integers such that x? + 
y+ 2? =0. 
(1) Ifp)axyz then 


t 
Coe cr gee 
81 ty try T1S1 


(2) Ifp|z then 
(=) Ss, 
T7181 


ar) tee 
pst, ptyry 


ProoFr. (1) To begin, we note that the polynomial pXY(X + Y) 
divides (X + Y)? — (X? + Y”) (in Z[X, Y]) (see also Section VII.2). 
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By (1A) we have (a + y)?- — t? = 0 (mod pry). Then t? = 
(a+ y)?-! (mod 1181), hence 


Gis) = (i) 

T1$81 T1$1 

Ga) ea) 
Sit, ryt, 


(2) Ifp|z, by (1C) we have (x + y)?~' — pt? = 0 (mod pry). 
Hence pt} = (x+y)?! (mod r;s,). Therefore, 


P 
(=) = (=) SM, 
T7181 81 
As in the proof of (1), (y+z)?-'—r? =0 (mod pry). Therefore 
re =(y+z)?"' (mod psity), 


Ga ae) 
psyty pst; 
(=) =41. 
prity 
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IV 


Germain’s Theorem 


In this chapter we give the beautiful theorem of Sophie Germain for 
the first case of Fermat’s theorem. 


IV.1. Sophie Germain’s Theorem 


Sophie Germain, a French mathematician, contemporary of Cau- 
chy and Legendre, proved a very beautiful theorem of an entirely 
new kind which established “d’un trait de plume” (in Legendre’s 
expression) the first case of Fermat’s theorem for every prime p < 
100. Her method is still being explored by other mathematicians. 
Legendre developed 5S. Germain’s ideas in his paper of 1823. 

We begin with an easy observation; (3) was also given by Bang 
(1935). 


(1A) Let gq be a prime and n > 3 be an odd integer. The following 
statements are equivalent: 


(1) There exist integers a,b,c, not multiples of q, such that a” + 
b” + c* =0 (mod 4g). 
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(2) There exist integers d,e, not multiples of q, such that d” = 
e” +1 (mod gq). 
Moreover, if q— 1 = 2kn, the above statements are equivalent to: 


(3) There exist roots u,u’ of the congruence X**—1=0 (mod q) 
such that u'’ =u+1 (mod q). 


PRroorF. (1) — (2) Since g /c there exist integers d,e € Z such that 


dc = —a (mod q), 
ec = b (mod q). 


Then q /de, (dc)” = (ec)” +c” (mod q), so d” = e” +1 (mod q). 

(2) — (1) This is trivial. 

Now we assume that q— 1 = 2kn. 

(2) > (3) Let u=e”, u’ = d”, then u** =e?! =1 (mod q) and 
similarly (u’)?* = d?-' = 1 (mod q), with u’ =u+1 (mod q). 

(3) — (2) Let h be a primitive root modulo g. Let u = h”, 
so h?*™ = uy? = 1 (mod q), hence q— 1 = 2kn divides 2km, so n 
divides m. Thus u = e” (mod q). Similarly u’ = d” (mod q) and 
d” =e” +1 (mod q). 


Now we give Legendre’s version of Sophie Germain’s theorem:! 


(1B) Let p,q be distinct odd primes and assume that the following 
conditions are satisfied: 


(1) Ifa,b,c are integers such that a? + b? +c? =0 (mod q) then 
q | abc. 
(2) p is not congruent modulo q to the pth power of an integer. 


Then the first case of Fermat’s theorem is true for the exponent p. 


PRooF. Let x,y,z be pairwise relatively prime integers, not multi- 
ples of p, such that x? + y? + z? = 0. Then x? + y? + z? =0 (mod q) 


See footnote, p. 13 of Legendre’s paper of 1823, where he wrote: “This 
proof which, one has to agree, is very ingenious, is due to Mlle. Sophie Ger- 
main, who cultivates with success the physical and mathematical sciences, 
as witnesses the prize she has been awarded by the Academy for her paper 
on vibrations of elastic blades. She is also the author of the proposition 
in art. 13 as well the one which concerns the particular form of the prime 
divisors of a, given in art. 11.” [Here, these correspond to propositions 


(2B), (2C)).] 
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and by hypothesis (1), q | yz. We may assume, for example, that 
q| x, hence q / yz. 

Since p ) xyz there exist integers r, s,t, 11, $1, ¢, satisfying the rela- 
tions (1.2) and (1.3) of Chapter III. Since q | x then —r?+5?+t? =0 
(mod q). By hypothesis (1), g divides one of the integers r, s,t. Since 
s divides y, t divides z, and g does not divide yz and gq does not di- 
vide st, so q divides r. But t? = (a? + y?)/(a + y) = y?' (mod q), 
because q | 2. 


Since q divides r, so y = —z (mod q). Hence 
P + zP 
re yt ey (2) pt (a 
ytz 


= py’ = pt? (mod q). 


Since t; # 0 (mod q) there exists an integer t’ such that t/t; = 1 
(mod q), hence p = (t’r;)? (mod q), which contradicts the second 
assumption. 


Before proceeding, we comment on the above conditions. 

In the next section, we shall introduce the Wendt determinant, 
which will serve to test the existence of integers x, y, z, not multiples 
of q, such that x? + y? + z? =0 (mod q). 


(1C) Ifpandq are odd primes and q—1 = 2pk, k a natural number, 
then condition (2) of (1B) is equivalent to each of the following: 


(2’) (2k)?* £1 (mod q); and 
(2) p* #1 (mod q). 


PROOF. We show first that (2) implies (2’). Let h be a primitive 
root modulo gq and let p = h* (mod q). If (2h)** = 1 (mod q) then 
h2ks = p?® = (2k)?*p?* = (2kp)?* = (q — 1)?* = 1 (mod q); hence 
q—1 = 2kp divides 2ks so p | s and p = a? (mod q) with a = h'/? 
(mod q). 

Now we show that (2’) implies (2”). If p?* = 1 (mod q) then 
(2k)?* = (2k)?*p?* = (q— 1)?* =1 (mod p). 

Finally, we prove that (2”) implies (2). If there exists an a such 
that p =a? (mod q) then p?* = a?*? = at! =1 (mod q). 


In Table 6 (see Legendre, 1823), we indicate, for each p < 100, 
the choice of g, of a primitive root h modulo qg and the set R of 
residues of pth powers, modulo g. The computations, which are 
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quite lengthy, are done using the primitive root modulo gq. They 
establish condition (2). 

For larger values of p the computations become forbidding. How- 
ever, a glance at the table reveals that in each case, g has been chosen 
to equal 2p+1, or 4p+1, or 8p +1, or 10p+1, or 14p+1, or 16p+1. 
Indeed, the following corollaries of Sophie Germain’s theorem hold 
(Legendre, 1823): 


(1D) [fp is an odd prime and q = 2p +1 is also a prime, then 
the first case of Fermat’s conjecture is true for p. 


PROOF. We show that q satisfies the assumptions of (1B). 

If x,y,z are integers not multiples of gq and x? + y? + z? = 0 
(mod q), then from p = (q —1)/2 we have x? = +1 (mod q), y? = 
+1 (mod gq), z? = +1 (mod q). Hence 0 = 2? +y?+z? =+1+141F 
0 (mod q), a contradiction. 

Similarly, if p = a? (mod q) then condition (2’) is not satisfied, so 
2p +1 =q divides 2? — 1 = 3, which is absurd. 


Legendre extended this criterion (1823): 


(1E) [fp isa prime, p>3 andq=4p+1, orqg=8p+1, orqg= 
10p +1, org = 14p+ 1, org = 16p +1 is also a prime, then the 
first case of Fermat’s theorem is true for the exponent p. 


PROOF. We show that in each case q satisfies the assumptions of 
(1B). 


Case 1: Let q= 4p +1. 

If p = a? (mod q) then by condition (2’) above, 44 = 1 (mod q) 
so 4p + 1 = q divides 255 = 3 x 5 x 17, which is absurd. 

For the first condition, let w be a primitive fourth root of 1, mod- 
ulo q. So {1,w, w?, w?} are the roots of X* — 1 = 0 (mod q), and 
w? = —1 (mod q), w* = —w (mod gq). If the first condition is not 
verified, then by (1A) there exist i 4 7,0 < i,7 < 3, such that 
wi = w'+1 (mod q). Apart from trivial cases, this leads to one of 
the following possibilities: w = +2 (mod q) or 2w = +1 (mod q). 
Raising to the square it follows that qg = 5, which is absurd. 


Case 2: Let q = 8p +1. 
If p = a? (mod q), then proceeding as before, 8p + 1 = q divides 
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TABLE 6 

p q h R 

3 7=2x34+1 3 +1 

i) 11=2x541 2 Pek 

7 | 29=4x7+1 (2 /+1, +12 

11} 23=2xi1l+1)5 |+ 

13} 53=4x 1341 | 2 1, +23 

17|137=8x17+1 [3 |+1, £10, +37, £41 

19/191 =10 x 19+1]|19|/ +1, £7, £39, £49, £82 

23} 47=2x 2341/5 |+1 

29) 59=2x 294+ 1/2 J+ 

31 | 311 = 10 x 31 +1] 17/41, £6, £36, £52, £95 

37/149=4 x 374+ 1 |2 1, +44 

41) 88=2x414+1/2 |+ 

43 |}173=4x 484+ 1 | 2 1, +80 

47|659 = 14x 4741/2 |+1, £12, £55, £144, £249 
270, +307 

53|107=2 x 5384+ 1 2 pai 

59 |827=14x59+1/2 | +1, £20, £124, £270, £337 
389, +400 

61/977=16 x 61+ 1/3 | +1, £52, +80, £227, £252 
357, +403, +439 

67 /269=4x67+1 /2 | +1, +82 

71/569=8x71+1 /3 |+1, £76, £86, £277 

73|2983=4x73+1 [2 |+1, £138 

79/317=4x79+1 [2 |+1, +114 

83 | 167 = 2 x 8384+ 1 5 +1 

89 |} 179 =2 x 89+ 1 2 a 

97/389 =4x97+1 |2 1, +115 
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8° — 1, so it divides 8* — 1 = 4095 = 37x 5x 7x 13 0r 84+1= 
4097 = 17 x 241, which is an absurdity. 

If the first condition is not verified, if w is a primitive eighth root 
of 1 modulo gq, there exist i 4 7,0 <7, 7 < 7, such that w? = w'+1 
(mod q). ee w* = —1 (mod q) we may consider the congruences 
w) = +w'+1 (mod q), with 0 <i < 3. The primitive eighth roots 
of 1 modulo q being tw, +w?, and w? = —w7' (mod q), we are 
reduced to study the following possibilities: 

42) (niod).g) (= 15.2); 
w+1 (mod q); and 

(iii) 2w = +1 (mod gq). 

We discuss the various cases. 

(i) Raising to the fourth power leads to q | 15 or g = 17, an 
absurdity. 

(ii) Raising to the square, 


w+1+2w+1=3w+2 (mod q), 
w—1-—2w+1=-—w (mod gq). 


-lsutt2w+1={ 


Hence w = +1 (mod q), an absurdity. 
(iii) Raising to the fourth power, g = 17, absurd. 


Case 3: Let q= 16p + 1. 

If p= a” (mod q), then with the same method, 16p+1 = q divides 
16°© —1 = (168 + 1)(16* +1)(16? +1) x 17x 15. Clearly 16p+ 1 does 
not divide 15, 17, 257. If 16p +1 divides 164 + 1 = 65537, which 
is a prime (in fact a Fermat prime, 2'° +1), then p = 16°, which 
is absurd. If 16p + 1 divides 16° + 1 = 232 +1 = 641 x 6700417 
(decomposition into primes given by Euler), then p = 40 or 418776, 
again absurd. 

If the first condition is not verified, if w is a primitive sixteenth 
root of 1 modulo q, since tw, tw?, we tw are all the primitive 
sixteenth roots of 1 modulo gq, and since w® = —1 (mod q) there exist 
i,j,0<i<j <7, such that wi = +w’ re | q). This leads to 
one of the following congruences, with all possible sign combinations: 


(i) tS 2 (raed Gf) {7 12 oe Os 
(ii) w? =w+l1 (mod q); 
(iii) w? = +w i 1 (mod q); 
(iv) wt=w+tl a q); and 
(v) wt = +w? +1 (mod q). 
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We discuss the various possibilities. 

(i) Raising to the eighth power leads to q | 257 or q | 255 = 
3 x 5 x 17, an absurdity. 

(ii) Raising to the square leads to wt = —w (mod gq), which is 
impossible, or w* = 3w +2 (mod q). Squaring again, this leads to 
—1=9w?+12w+4 (mod gq), and substituting w? = w+1 (mod q) 
we obtain 3w = —2 (mod q). Hence wt = 3w +2 = 0 (mod q), 
which is impossible. 

(iii) Raising to the cube: 


—w =+w'?4+3w?+3w+1 (mod gq) 


and substituting, 


—w=wtl4+3w?+3w+1 (mod q). 


According to the choice of the sign, we have 3w? + 5w+2 = 0 
(mod q) or 3w? — w = 0 (mod q). 

In the first case, multiplying with w and substituting, we obtain 
5w? + 5w +3 = 0 (mod q); subtracting, 2w? = —1 (mod q) and 
therefore —2+ = 1 (mod q), so q | 17, impossible. If 3w = 1 (mod q), 
then 3w? = —3w + 3 = 2 (mod q) so raising to the cube, —3°w = 8 
(mod q), hence q divides 17, an absurdity. 

(iv) Squaring: —1 = w?+2w +1 (mod q), so w? = F2w — 2 
(mod q), hence w +1 = wt = 4w? + 8w +4 (mod q). This gives, 
according to the choice of sign, 


and subtracting, 
= { ter (mod q). 


So w4 = wt1 = —4 (mod gq) and raising to the square, —1 = 16 
(mod q) so q | 17, impossible. 

(v) Raising to the square, —1 = w* + 2w? + 1 (mod q) and 
substituting wt = +w? +1 (mod gq), in all cases we obtain w? = +1 


(mod q), which is impossible. 


Case 4: Let gq = 10p +1. 
If p = a? (mod q), by the above method 10p + 1 = q divides 
(10° + 1)(10° — 1). If q divides 10° + 1 = 100001 = 11 x 9091 (this 
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last number is a prime) then p = 909, an absurdity. If q divides 
10° — 1 = 99999 = 3? x 41 x 271 then p = 4 or 27, an absurdity. 

If the first condition is not verified, if w is a primitive tenth root 
of 1 modulo q, then there exist 7,j7,0 <i< Jj < 4, such that wi = 
w' +1 (mod q). The above conditions lead to either one of the 
following congruences: 

(i) wi =+2 (mod q) (j = 1,2,3,4); 
(ii) w? = w+1 (mod q); and 
(iii) wt = w+t1 (mod q). 


We discuss the various possibilities. 

(i) Raising to the fifth power leads to q | 31 or q| 33 = 3 x 11, 
impossible. 

(ii) We have 


3w+2 
—4 


wi=w?+2wt+l=wtl 2w+1={ (mod q). 
The second case is not possible. In the first case multiplying with 
w and substituting —1 = 3w? + 2w = 3w+3-+ 2w (mod q) so 
5w = —4 (mod q). Multiplying with the previous congruence, —5 = 
—12w — 8 (mod q) so 12w = —3 (mod q) and 4w = —1 (mod q) 
hence subtracting, w = —3 (mod q) and therefore q | 11, absurd. 

(iii) Multiplying with w: —1 = w?+w (mod gq), so w? = F-w-1 
(mod q) and this was considered in case (ii). 


Case 5: Let q= 14p + 1. 

If p = a? (mod q), we see in the same way that q divides 147 + 
1 or 147-1. But 147+ 1 = 105413505 = 3 x 5 x 7027567 (this 
last number is a prime). Then p = 501969, an absurdity since this 
number is a multiple of 3. Also, 147-1 = 105413503 = 13x8 108731 
(this last number is a prime). Then p = 579195, which is absurd. 

If the first condition is not verified, if w is a primitive fourteenth 
root of 1 modulo q, then there exist 1 4 j, 0 < i, 7 < 18, such that 
wi =w'+1 (mod gq). Since w’ = —1 (mod q), the above conditions 
lead to either one of the following congruences: 


(i) w? =+2 (mod g) (j = 1,2,... ,6); 
(ii) w? =w+l1 (mod q); and 
(iii) w* = +w +1 (mod gq). 


We discuss the various cases. 
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+27 (mod q), so q | 127 or 129 = 3 x 43, which gives 
is was excluded. 
ii) w?=w+l (mod q) gives 


wi=w?+2wt+l=wtl 2wtis{ et? (mod q), 


the second case being impossible. Squaring, 
—w=9w?+12w+4=9w+9+12w+4 (mod q), 


so 22w=—13 (mod q). Then 22w? — 22=-—13 (mod q) so 22w?=9 
(mod q). Then —13w=9 (mod q) and from this we obtain 9w=—4 


(mod q), —4w = 5 (mod q), 5w = 1 (mod q); so 25 = —20w = —4 
(mod q), hence q | 29, which is impossible. 
(iii) w*® =+w-+1 (mod q) gives, to the cube: 
—w’? = tw? 4+ 3w? + 3w+1 


= +wt+14+3w*+3w+1= 3w? +4w +2 (mod q), 


hence 2w?+2w+1= 0 (mod gq) so 2(4w+1)+2w?+w =0 (mod q) 
and 

0 (mod q), 

0 (mod q), 


bo db 


2Qw? — w 


{ Qw? + 3w + 


hence 


—3w 4 0 (mod q), 


which is impossible. 


{ a, (mod q), 
), 


and this gives w = +1 (mod q 


With this criterion, Legendre had actually shown that the first 
case of Fermat’s theorem holds for every prime exponent p < 197. 
Indeed, for each such prime p, there exists a prime q = 2kp+1, with 
2k € {2,4,8,10,14,16}. On the other hand, 38 x 197 + 1 = 7487 is 
a prime, but 2k x 197+ 1 is not a prime if 2k < 38, 6 2k. 

The limitation in Legendre’s results was due to the size of the 
numbers involved. For example, to test whether p = 197 is not a pth 
power modulo q = 7487, would lead to find whether 7487 divides 
3819 + 1. Maillet extended Legendre’s result in 1897, pushing the 
limit up to p = 211. 

Mirimanoff used a method involving Bernoulli numbers, in 1905, 
to extend the results to 257. 
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In 1908, Dickson published two papers in which he explored Legen- 
dre’s ideas and with a more careful analysis involving congruences, 
he showed that the first case of Fermat’s theorem holds for every 
prime exponent p < 7000 (with the exception of p = 6857, which he 
did not take the trouble to examine). See also Maillet’s comments 
(1908). 

More progress along this line was made by Krasner (1940), Dénes 
(1951), and Rivoire (1968). 

For the primes p such that 6p + 1 or 12p+ 1 is also a prime, the 
method of proof breaks down and does not lead to any conclusion. It 
should be noted that in 1974 Gandhi announced without proof that 
if p and 6p+1 are primes then the first case holds for p; since no proof 
has ever been published, there is reason to doubt of the justification 
of the statement. In this connection, we quote the paper by Granville 
and Powell (1988). 

An interesting, but very difficult question is whether there exist 
infinitely many primes p such that 2p + 1 (or 4p +1, or 8p 4+ 1, 
etc...) is also a prime. We discuss this problem in the Appendix to 
this chapter. 

We still note here the following result of Vandiver (1926) : 


(1F) Let p and q = 2kp +1 be odd primes (with k > 1). If 
2k = 2°p", where h > 0 and p does not divide v, and if 2 is not a 
pth power modulo q, then condition (2) above is satisfied. 


PROOF. We show (2’). If (2k)?* =1 (mod q) or equivalently p** = 1 
(mod p), then 22% = Q?hup2kh _ (Q%pr\2k = (2k)?* = 1 (mod q). 
Since p does not divide v, there exist integers a,b such that av = 
1+ bp. Then 


l= g2kva = 9 (1+bp) 2k = oa = Dh geeny? = 92k (mod q). 
If g is a primitive root modulo q and 2 = g* (mod q), then 1 = 


22k = g?ks (mod q). So g—1 = 2kp divides 2ks, hence s = ps’ and 
2 =(g° )” (mod q), which is a contradiction. 


Using (1B), Vandiver deduced in 1926 the following result which 
however had been proved by Wendt in 1894, using his form of Sophie 
Germain’s theorem: 
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(1G) [fp andq=2kp+1 are odd primes, with 2k = 2°p", h > 0 
and v not divisible by p, if the congruence X?+Y?+ Z? = 0 (mod q) 
has only the trivial solution, then the first case of Fermat’s theorem 
holds for the exponent p. 


Proor. By (1B) and (1F) it suffices to show that 2 is not a pth 
power modulo g. If 2 = a” (mod q) then a? + (—1)? + (-1)? = 
2+ (—1) + (-1) = 0 (mod gq) contrary to the hypothesis. 


Sophie Germain’s theorem, corollaries and variations were redis- 
covered by several authors. In 1953, Thébault proved: 


(1H) If m > 2 is an integer such that 2m +1 is a prime, if 
there exist pairwise relatively prime nonzero integers x,y, z such that 
a” +y™” = z™ then 2m+1 divides xyz. 


PrRooF. If 2m+1 does not divide x then by Fermat’s little theorem 
x?" = 1 (mod 2m +1) hence x” = +1 (mod 2m + 1). 
Similarly 


y” = +1 (mod 2m+ 1), 
z™ = +1 (mod 2m+1), 


hence 0 = 2” + y™ — z™ = (41)4+ (+1) — (41) (mod 2m+ 1), which 
is impossible. 


This same result (even with the further hypothesis that m be 
prime) is proved again by Stone in 1963 and Gandhi in 1966, in the 
same journal! 

Gandhi showed, also in 1965, a result similar to Thébault’s: 


(11) Ifm> 2 is an integer such that 4m+1 is a prime, if x,y, z are 
nonzero pairwise relatively prime integers such that 2” + y™ = z™ 
then 4m +1 divides xyz. 


ProoF. If m= 3, the statement is trivially true, by Chapter I, 84. 

Let m > 3 and assume that 4m + 1 does not divide xyz. From 
xc” +y™ = z™ it follows that 2?"+ y?2"+2ar™y™ = z?™, Since 4m+1 
is a prime not dividing x then by Fermat’s little theorem x*” = 1 
(mod 4m +1), so 7?” = +1 (mod 4m +1). Similarly 


yy" =+1 (mod 4m+ 1) 
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and 


2™=+41 (mod 4m+1). 


Hence +1 +1 +4 22y™ = +1 (mod 4m +1) so 2a™y™ = +1 or +3 
(mod 4m + 1) and +4 = 4x?"y?” = 1or9 (mod 4m+1). This 
implies that 4m + 1 = 3,5, or 18, hence m = 3, which is a contra- 
diction. 


This same result (with the further hypothesis that m is prime) is 
proved again by Gandhi in 1966 and 1970 and by Christilles in 1967. 
Perisastri (1969) proved: 


(1J) Ifp>51 is a prime such that 8p+1 is also a prime, if x,y, z 
are nonzero pairwise relatively prime integers such that x? + y? = z? 
then 8p +1 divides xyz. 


(1K) IJIfm>3 is an integer such that 3m+1 is a prime, if x,y,z 


are pairwise nonzero relatively prime integers such that x™ + y™” = 
z™, then 3m+1 divides xyz. 


Krishnasastri and Perisastri proved in 1965: 


(1L) Jfp is an odd prime, if x,y,z are integers such that x? +y? = 
z? and p does not divide xz, then there exists an integer k > 1 such 
that 1+ kp divides z. 


Combining (1C) with Sophie Germain’s theorem, we have (see 
Stone (1963), Perisastri (1968)): 


(1M) Let p and 2p +1 be odd primes. If x,y,z are nonzero, 
pairwise relatively prime integers such that x? + y? + z? = 0, then p 
divides one (and only one) of the integers x, y, z. 


PROOF. By Sophie Germain’s theorem we may assume, for example, 
that p divides z. By (1C), p? divides z. 


Pomey obtained in 1923 and 1925, with similar methods, several 
sufficient conditions for the first case of Fermat’s theorem for the 
prime exponent p: 
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(1N) Let p be an odd prime and assume that either one of the 
following conditions is satisfied: 
(a) p=1 (mod 4) and 2p +1 divides 2? +1. 
(b) p=3 (mod 4) and 2p +1 divides 2? — 1. 
(c) 4p +1 divides 27? +1. 
(d) 4p +1=5 (mod 12) and 4p+ 1 divides 3°? +1. 
) 
) 


8p +1 divides 24? — 1. 
10p +1 divides 5°? — 1. 


Then the first case of Fermat’s theorem is true for the exponent p. 


All the above results do not suffice to conclude that there exist 
infinitely many prime exponents p for which the first case of Fermat’s 
theorem is true. This was first proved, with analytical methods, in 
1985 by Adleman and Heath-Brown and and by Fouvry. 

Earlier, in 1897, studying the class group of the cyclotomic field, 
Maillet showed that for every odd prime p there exists an exponent 
e (depending on p) such that the first case of Fermat’s theorem is 
true for the exponent p*. In particular, this implied the existence of 
an infinite set of pairwise relatively prime exponents for which the 
first case is true. This last statement was proved again by Kapferer 
in 1964. His proof was not elementary, since it used the theorems of 
Furtwangler, as generalized by Moriya (requiring class field theory). 
In 1978, Powell discovered independently the following very simple 
proof: 


(10) 
(1) If p is any odd prime, n = p(p—1)/2 = 2"m where u > 0, m 
is odd, if x,y,z are nonzero integers such that x*+y"+z”" = 0 
then gcd(m, xyz) #1. 
(2) There exists an infinite set of pairwise relatively prime expo- 
nents for which the first case of Fermat’s theorem is true. 


Proor. (1) If p = 3 then n = 3 and the hypothesis is not satis- 
fied. Let p > 3. Suppose that gcd(m, xyz) = 1. Then p J xyz so 
g-1)/2 = +1 (mod p) and 2” = +1 (mod p). Similarly y” = +1 
(mod p), z” = +1 (mod p), hence x” + y” + z” #0 (mod p) and a 
fortiori, c” + y" +2” £0. 

(2) Assume that ni,..., x are pairwise relatively prime expo- 
nents for which the first case of Fermat’s theorem is true. Consider 
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the arithmetic progression {—1 + 4n,n2---nzt | t = 0,1,2,...}. By 
Dirichlet’s theorem on primes in arithmetic progressions there exists 
an odd prime p such that p = —1 (mod 4n;n2---n;,). Let nei = 
p(p — 1)/2, so ngy1 is odd. Since ged (p(p — 1)/2, (p + 1)/2) = 1 
then gcd(ng41, 21°--Nz) = 1. By (1), the first case is true for the 


exponent n,41, and this suffices to complete the proof. 
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IV.2. Wendt’s Theorem 


Wendt indicated in 1894 a determinantal criterion for the existence 
of a nontrivial solution of Fermat’s congruence 


(2.1) XP? +4+Y?+4 Z? =0 (mod q), 


where p,q are distinct odd primes. 

To begin, we wish to exclude from our discussion the following 
trivial case; it also holds without assuming the exponent in (2.1) to 
be a prime: 


(2A) Ifq is an odd prime, ifn > 1 is such that ged(n, g—1) = 1 
then there exist integers x,y,z, not multiples of q, such that x” + 
y” +2” =0 (mod q). 


PROOF. By hypothesis, gcd(n, q— 1) = 1, so there exist integers a, b 
such that an + b(q—1) =1. Let 29, yo, Zo be integers, not multiples 
of q, such that xo + yo + 2 =0 (mod q). Then 


xa” = x9 (mod q), 
= yo (mod q), 
= z% (mod q), 


so (ae)" + (y@)” + (z@)” =0 (mod q). 


In particular, if n = p is a prime not dividing q— 1 then (2.1) has 
a nontrivial solution. 
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Wendt’s criterion is expressed in terms of the circulant of binomial 
coefficients. More generally, let n > 1 and let €; = cos2mi/n + 
J/—Isin2ri/n (for i = 0,1,...,2—1) be the n nth roots of 1; so 
& = 1. The circulant of the n-tuple (ao, @1,... ,@n—1) of complex 
numbers a; is, by definition, the determinant of the matrix 


dg Gy «ee Any 
(2.2) ae Gn-1 Ao «+. Gn_2 
Q, dg ... 
We denote it by Circ(ao, a1,... ,,@n—1). The circulant is expressed 


in terms of nth roots of 1 and equally as the resultant of two poly- 
nomials (see Chapter II, 84). Spottiswoode (1853) and also Stern 
(1871) and Muir (1920) proved: 


LEMMA 2.1. Let Qo, @1,--- ,An-1 © K, let G(X) =agt+ a, X feet 
GQn-1X"! and let & = 1,£1,...,€:-1 be the nth roots of 1. The 
circulant of ao,d1,... ,@n—1 1s equal to 


n—-1 


Circ(a9, @1,--- , @n—1) =11G60 (€;) = Res(G(X), X” — 1) 


(where Res denotes the resultant). 


PROOF. Let 
010 =. 0 
001. 0 
A= 
100 ::. O 


(n x n matrix), so I, A,A?,... ,A”~' are distinct and A” = J. As is 
easily seen, 


Cagar eA Het: ete. 


The characteristic polynomial of A is det(X A — J) = X" — 1. Since 
it has distinct roots & = 1,&,...,&—1, then A is diagonalizable, 
that is, there exists an invertible matrix U (with complex entries) 
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such that 
fo 0 0 
0 
UAU Y= ‘1 
0 0 oo 
Hence 
G(f) 0 0 
0 G yt 0 
LUCE a : - . : 5) 
0 0 oats G(En—1) 


where G(X) = agta,X+-+-+an_-1X"~'. So Circ(ao, a1,.-. ,@n—1) = 
det(C) = det(UCU~) = JJ") G(€,). By Chapter II, (4B), we also 
have Circ(do, a1,... ,@n) = Res(G(X), X” — 1). 


The following result of Wendt is about the circulant of binomial 
coefficients. Accordingly, for every n > 1 we define the Wendt de- 
terminant to be 


(2.3) wa micie(1,(7). (Blasi a a) 


IEG X) = 14 ()X4 (YAP (0 AOE = (LEX) X" then by 
the lemma, W,, = Res(G(X), X" — 1) = JJ? [(1+&)" —1]. Now 


we give Wendt’s criterion (1894); see also Matthews (1895), Bang 
(1935), and an expository presentation by Rivoire (1968): 


(2B) Let p be an odd prime and assume that q = 2kp + 1 (with 
k > 1) is also a prime. Then there exist integers x,y, z, not multiples 
of q, such that x? +y?+ 2? =0 (mod q) if and only if q divides W2,,. 


PRroor. By (1A), Fermat’s congruence X? + Y? + Z? = 0 (mod q) 
has a nontrivial solution if and only if the system of congruences 


X?* =1 (mod q), 
{ (X +1)* =1 (mod q), 


has a nontrivial solution, or equivalently, the system of congruences 


X?k _1 =0 (mod q), 
(X + 1)?* — X?* =0 (mod q), 
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has a nontrivial solution. This holds exactly when the resultant of 
the polynomials X?* —1 and G(X) = (X +1)?*-—X7*=14+(?")X+ 
(7) X? + +--+ (,7",)X?*-1 is congruent to 0 modulo g; by Lemma 
2.1 this means that W2, = 0 (mod q). 


Now we turn our attention to the computation of Wendt’s deter- 
minant 


(2.4) - Thi ieee =i. 


The following result was stated without proof by Wendt (1894), and 
proved thereafter also by Matthews (1895), E. Lehmer (1935), Bang 
(1935), and Frame (1980): 


(2C) W, =0 éf and only if 6 divides n. 


Proor. Assume that 6 divides n, and let € = € = cos2a/n+ 
V—Isin27/n. Let 1 = n/3, so w = €' is a primitive cubic root of 1. 
Hence 1+w+w? = 0, therefore 1+ ¢' = —€” and (1+ €')” =1. We 
conclude that W,, = 0. 

Conversely, if W,, = 0 there exists j such that (1+ ;)” = 1, 
so €;, 1+; are nth roots of 1 and since the triangle with vertices 
0, 1, 1 = &, 


gj 1+ & 


0 1 


is equilateral, then 6 = 27/6 (or 6 = —27/6) and 1+; is a primitive 
sixth root of 1. But (1 + €;)" =1 hence 6 divides n. 


As a corollary: 


(2D) If p and 6mp +1 = gq are primes then the congruence 
X? + Y" 4+ Z? =0 (mod gq) has a nontrivial solution. 


PRooF. This follows at once from (2B) and (2C). 
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In view of (2C), it is customary to modify the definition of the 
Wendt determinant when 6 divides n, by putting 


w. =] e&), 
i=0 


where 

(X +1)" — xX” 
X?4 X41 0 

In 1935, E. Lehmer indicated, without proof: 


(2.5) G(X) = 


(2E) [fd divides n then W, divides W,. 


PROOF. We may assume that W, 4 0, i.e., 6 /d. We note that since 
d|n then each dth root of 1 is also an nth root of 1. From Wz 4 0, 
then 


W,, (1+&)"-1 


— = |J ——— « [I [0a+6)”-1. 
Wa e4=1 (1 TT E;)4 = erai,e¢Al 

If n = de then 
(Ons Maree 


deeje=1.~ Ce) ia iee cere Gt) cad 


Therefore W,,/W, is an algebraic integer, but also a rational number, 
hence a rational integer. 


The next property was also indicated by E. Lehmer, without proof. 
A proof (not the one below) appears in Frame’s paper (1980); the 
weaker statement 2” — 1 divides W,, was proved by Bang (1935): 


(2F) Ifn>1 then W,, = (—1)""'(2"—1)u?, where u is an integer. 


PROOF. 


n-1 


We= Ge)" =); 


j=0 


where € = cos2a/n+ /—ILsin2a/n. Thus W,, has the factor 2” — 1 
(when 7 = 0) and if n is even, also the factor —1 (when j = n/2). 
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So 
We (12? =) TT ee) aa: 


j#0,n/2 

Let u = Tocjensal(1 + &7)” — 1]. We note that for every 7 4 0, n/2 
we have (1+ €-4)"—1 = [€-4(1+ &)]"-1 = (14+ &)"—1 so wis real 
and W,, = (—1)""'(2” — 1)u?, and it remains to show that u € Z. 
Let o be any automorphism of the field Q(£€), so o(€) = €' where 
1<l<nand gcd(l,n) = 1. Hence o [(1 + &’)" —1] = (14+ €*)” 

where 0 < k < n/2 and jl = +k (mod n) because (1+ €-")"-—1= 
(1+€*)"-1. If0 <j, 7’ < n/2, let k, k’ be such that 0 < k, k’ < n/2 
and jl = +k (mod n), 7/1 = +k’ (mod n). We note that if 7 4 7’ 
then k # k’, because if jl = +j’l (mod n) then j = +)’ (mod n) 
and this would imply 7 = 7’. Therefore, different factors of u have 
distinct images by o, so a(u) = u hence u € Q, being invariant by 
the automorphism of Q(£€). But wu is also an algebraic integer, hence 
we Z. 


For n even, Frame proved (1980): 


(2G) Ifn=2m and3/n then W, = —3((2” — 1)/3)%u® where 
u is an integer. In particular, if p is a prime, p = 5 (mod 6), then 
W,-1 = —3((2?-1 — 1)/3)%u®, where u is an integer. 


PRooF. Let n = 2m. Since 3 /'n, p = €° is also a primitive nth root 
of 1. From p™ = —1 by (2.4) we have 


2m—12m—-1 : 
= II II (1 +p"? + pr), 
j=0 k=0 


If 7 = m we have Te) (2 + op™**) = 2?" — 1, similarly if k = 
m we have []j™) ‘(2 + p™t/) = 2?” — 1, while if j = k we have 
Te (1-27) = 1-2”. Discounting the repetition of factors with 
equal summands, and noting that for 7 = k = m we have the factor 
3, then W,, = —3((2" — 1)/3)?u where 


! : 
v= II (1 + pmti + p™t*) 


(II’ indicates the product for all (j,k) such that 0 < j,k < 2m — 
1,j7#m,k#m, and j #k). 
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The geometric mean of 1, p”™*/, p™** is €** (because p = &3); 
dividing each factor in the above product by the geometric mean of 
the summands, we get 


— De“ Ee sh rr’ ae Cn ys 
The first product is equal to 


II II erg = II EI(2m—2) II ek 


jAm kAém, kAj jAm kAm,kAj 
2m—1 
_ [J -vee® II gk 
j#m j=0 
= [] Giger eer? 
jem 
jem 
2m—-1 
= (Sa) I] pen) 
j=0 


aap GRE orien)? 


a3 (eels — |. 


The second product is equal to [](€° + €/ + €%) (where []” is the 
product for all (e, f,g) such that 0 < e,f,g < 2m-—1,e,f,g are 
distinct and e + f + g = 0 (mod 2m)). Indeed, since 7 4 k, j # 
m,k#m, and 3 /m, letting e, f,g be such that 0 < e, f,g < 2m—1, 
and 


f =m-+2j—k (mod 2m), 


| e = —j—k (mod 2m), 
g=m+2k—j (mod 2m), 


then e, f,g are distinct and e+ f + g =0 (mod 2m). 
Conversely, for every triple (e, f,g) as indicated, let j,k be such 
that 0 < j,k < 2m —1, and 


j = f—e-—m (mod 2m), 
k =g—e-—m (mod 2m), 
sojAm,kAm,andjF#k. 


Let u = [J (€° + €/ + €9) (where []” is the product for all (e, f, g) 
such that O<e< f<g<2m-—lande+f+g=0 (mod 2m)). 
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We show that wu is invariant by every automorphism o of Q(E). 
If o(f) = é', with gced(I,n) = 1,1 <1 <n, then o(€* + & + €%) = 
€¢ 4 €f! + €5! which is a factor of u. If (e’, f’,g’) # (e, f,g) then 
el, fl, gl cannot be congruent modulo 2m to e’l, f’l, g/l, respectively, 
or to these numbers in any order. So different factors of u have 
different images by o, hence o(u) = u, showing that the algebraic 
integer wu is rational, hence u € Z. But v = u® because each factor 
€°+€f +9 of v is equal to a factor €° +€f +€% of u, where (e’, f’, 9’) 
is obtained by permutation of (e, f,g). 

Hence W,, = —3((2” — 1)/3)?u°. 


Before the next result we need to establish a lemma. 


LEMMA 2.2. Let n be such that 2n +1 = p is a prime, let € = 
cos 27/n + /—1sin27/n, and let P be any prime ideal of the cyclo- 
tomic field Q(E) which divides p, t.e., PAZ = Zp. Then there exists 
a primitive root s modulo p such that € = s* (mod P). 


PRooF. Indeed, let g be any primitive root modulo p, i.e., g mod- 
ulo p has order p— 1 = 2n, so g? modulo p has order n, and the 
set of elements modulo p with order n is {g?/ (mod p) | 1 < j < 
n; ged(j,n) = 1}. If1 < 9, k <n, ged(j.n) = ged(kyn).= 1, and 
j #k then g” # g** (mod p). By Chapter II, (3F), ®,(g%7) = 0 
(mod p) for all such exponents j. On the other hand, ©,,(€’) = 0 
so ®,,(€7) = 0 (mod P) for all j, 1 < j < n, ged(j,n) = 1. Hence 
there exists 7 such that € = g” (mod P), and we just take s to be 
s=+g’ (mod p),1<s<p. 


Frame also proved the next statement (while Bang had noted in 
1935 that p | W,,): 


(2H) IJIf2n+1=p is a prime then p\"-/*! divides W,,. 


Proor. To begin, let € = cos2r/n + /—I1sin27/n. Given u,v, 
l<u,uv <n, ged(u,n) = ged(v,n) = 1, for every j, 1 < 7 <n, such 
that gcd(j,n) = 1 let f; = 1— 6% — 9”. 

Let du = [ecarjny=1 £7. We show that d,,, € Z. Indeed, let o be 
any automorphism of Q(€), so o(€) = €' with gcd(l,n) = 1. Then 
o(1— £4 — £9”) = 1 €%" _ €*" where 1 <k <nandk=jl (mod n), 
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ie., o(f;) = fx. Every factor f, of d,,, is so obtained, therefore o 
defines a permutation of the set of factors f; of d,,,. Thus 


o(du.v) = Il o(f;) = II te = Unis 


ged(j,n)=1 ged(j,n)=1 


This proves that d,, € Q and since d,,,, is an algebraic integer, then 
du» € Z. If P is any prime ideal of Q(€) such that PN Z = Zp, if s 
is a primitive root modulo p such that € = s? (mod P) then 


du» = II (1 — 8" — 5”) (mod P), 


gcd(j,n)=1 
SO 
du» = II (1 — 8" — 57/”) (mod p). 
gcd(j.n)=1 
For every h, 2 < h < p—2, and such that h? 4 —1 (mod p) in 
case p = 1 (mod 4), we define ap, bn,0 < an, bn < p—1 by 


{ an = 2h/(h? +1) (mod p), 
by, = —(h? — 1)/(h? +1) (mod p). 


Then a, 4 0,1 (mod p) and b; 4 0,1 (mod p). Hence there exist 
U = Un, UV = Vp Such that 1 <u, v <p—1 and 


a, = s“ (mod p), 
by, = 8” (mod p). 


We have a? +6? = 1 (mod p) so 1—s?"—s*” =0 (mod p), therefore 
p divides dj. 

Ifh, h! with2 < h, h’ < p—2 are such that h? 4 —1 (mod p), h” # 
—1 (mod p) when p = 1 (mod 4), and if h’ = +h or +h (mod p), 
where hh = 1 (mod p) then it is easy to check that ay = tay, 
(mod p) or ay, = +b, (mod p) while b,, = +b, (mod p) or by = 
+a; (mod p), respectively; hence, with obvious notations, 1 — s?“ — 
3?” = 1 — 3 — 3”. Conversely, if h,h’ are such that this equal- 
ity holds then either a, = +a, (mod p), b, = +b, (mod p) or 
an = +b, (mod p), b, = +ay (mod p). Examining all possible 
cases, this leads to h’ = +h or +h (mod p). 

If p #1 (mod 4), the number of possible values for h is p— 3 = 
2n — 2; this yields [(2n — 2)/4] = [(n — 1)/2] factors 1 — 5?" — 5?” 
which are multiples of p. 
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If p = 1 (mod 4), the number of possible values for h is p— 5 = 
2n — 4; this yields [(2n — 4)/4] = [(n — 1)/2] (because n is even) 
factors 1 — s?“ — s?” which are multiples of p. 

Thus, in all cases, pl—))/?! divides J], ,, duyv. Let Wi, = [1], du,vt 
where t € Z (since t is an algebraic integer and t € Q); thus pl("—)/?! 
divides W,,. 


For example, 47!! divides W23 and 1014 divides W5o. 

The next divisibility result concerns Lucas numbers. For the con- 
venience of the reader we shall recall some relevant facts about Fi- 
bonacci and Lucas numbers (see also Chapter V, §3). 

The Fibonacci numbers F,, (n > 0) are defined as follows: 


Fo = 0, F, = 1, 
and for n > 2: 
B= Fan + Fao: 


Similarly, the Lucas numbers L,, (n > 0) are defined as follows: 


and for n > 2: 
Ly = Lied + Dgo3: 
Let a, 3 be the roots of the polynomial X? — X — 1, so 
1+ V5 
2 
_i-vs 
“9 


= 1.6180... , 


p 


= —0.6180..., 


and 
a+P=1, a-B=V5, aBb=-l. 


As is known, a is called the golden number (or golden ratio). 
The following lemma is attributed to Binet (1843): 


LEMMA 2.3. For everyn> 0: 
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PROOF. We have a? =a+1, 6? = 6+4+1,s0 a" =a™!+ a"? and 

also 3” = 3"~! + B"~* (for n > 2). 
Let T,, = (a” — B")/(a— B), so To = 0, T; = 1 and 

a mere 6 as an? — gr? a” — 3” 

i lees thes. = — 7 

1+ 2 58 + ee 60 

(for n > 2). Thus, the sequence {T;, | n > 0} coincides with the 

Fibonacci sequence. Similarly, let U, = a" + 6", so Ug = 2, U, = 

1 and U,,-1+Un_2 =q” 14 gn Liq 24g Dr a + Br = Ux (for 

n > 2). Thus, the sequence {U,, | n > 0} coincides with the Lucas 

sequence. 


For more results about Fibonacci and Lucas numbers, see, for ex- 
ample, the books by Vorob’ev (1961), Hoggatt (1969) or Ribenboim 
(1995). 

Frame proved: 


(21) If n is odd then L? divides W,,. 


PROOF. By (2C) we may assume that 3 /n. We have 


n-1 


[a -¢ -&) = -[]a-ea\1 -&8) 


: = —(1-a")(1— 6") 
= 14+ (a" +6") —(-1)" 
eit 


since n is odd, and using the preceding lemma. Similarly, im eee (1- 
€2k _¢k) — [,. Now we note that if 1 < j, k < n—1, then the pairs (j 
mod n, 27 mod n) and (2k mod n,k mod n) are distinct. Indeed, 
otherwise 7 = 2k (mod n) and 27 = k (mod n), hence 37 = 3k 
(mod n) and j = —k (mod n). Therefore 7 = k and j = n—k, so 
n = 2k is even, contrary to the hypothesis. 

This shows that W,, = L?v where v € Q, and v is an algebraic 
integer, so v € Z. 


For example, the squares of the Lucas numbers 


Laz = 6 643 838 879 
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and 
Ds3 = 119218851 371 


(which are known to be primes), divide W147, Ws3, respectively. 

Improving on a result of Lubelski (1935) and independently of 
Bang (1935), E. Lehmer indicated in 1935 the following divisibility 
property of the Wendt determinant: 


(2J) If p is an odd prime then p?~?(2?-1 — 1)/p divides W,_1. 


PROOF. Consider the matrix C’ whose determinant is W,_1: 
meee ney eats) 
1 2 p-2 
Cr!) (23) 
1 p—3 
SY (aa) Goa) tm Go) 
p—3 p—2 p48 


(7) (2) = 


Adding every column of C' to its last column, we obtain a matrix C’ 
whose last column has all its elements equal to 


pd eel pas -1 
1 2a = 9-1 = 4, 


Adding to each column of C’ (up to the column p — 3) the next 
column, we obtain a matrix C” such that the elements of the first 
p—3 columns are of the form 


(a) +(e) = (eta) 


for k = 0,1,...,p— 2. These elements are all multiples of p. Thus 
W,-1 = det C” is a multiple of p?—3(2?-' —1) = pP-?(2P-1 — 1) /p. 


For a recent related result, see Helou (1997). 
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Frame observed experimentally that if n < 50 and 6 //n then 


llogy) |W,,| — n? log, | < 0.33, 


where 


9 n/3 
loge = - | log(2 cos 6)dé, 
7 JO 


so log;yc = 0.140305. Thus W,, has about 0.1403 n? digits. The 
knowledge of the size of W,, and of some prime factors of W,, may 
guarantee that the factorization of W,, is already complete. 

In 1982, Boyd showed that the sequence {A,,}, with 


An = logi, |W,.| — n? logig ¢, 


is bounded and has exactly three limit points 0, $1log,,3, logy, 2, 


1 


corresponding, respectively, ton = 


o] 


2 or 3 (mod 6). 


The values of W,, (for n = 2k > 20) in the table below have been 


kindly provided to me by J.S. Frame. 
Wendt determinant: 

W, = 1, 

ae. 

= 28=2?x7, 

= —375 =—3 x 53, 

= 3751 = 11? x 31, 

= 0, 

= Due 2OP DT: 

= —37 x 53 x 173, 

= 07 eS Rr Se 7S, 

creo aa El aba a 

= 23° x 67? x 89 x 1997, 

= 0 


= 
T 


ele as a ae ae ae a ce 


ao uo ee Ww nN re Oo 
| 


= Be RTE RAT OS, 


ana ny oOo 
Howl We | 


SS SSS ZS SSSEZESREZEEF 


iw) 


x 449° x 65537°, 


= 23 eb Le Ble AT ee OL 

= —3 x 2371 x 67° x 89% x 199® x 6833, 

= 207 9¢ 55° 708 IST! 521° & 273 1° K 8191", 

= 209) 53-518" e207 dB 18 107? 197°, 
= 2B sc 5757" RIT KATE & OT C108? & 957° 353° 


GR Oa 9? @ Ble B21 B19, 


= O47 1? e381 <6? e151 OTT, 
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W34 = —3 x 103!? x 13712 x 239% x 307° x 409% x 613° x 3571° 
x 436913 x 1310718, 

W3g = —3 x 7° x 191)? x 22912 x 419° x 457° x 647° x 761° 
x 1483° x 9349° x 174763° x 5242873, 

Wig 8"? & Bt 11 Ble ALP? DAT? O81 oe O41 881° 
x 10485773, 

Was = —3 x 5° x 2318 x 67% x 897! x 199° x 3979 x 617° x 1013° 
DIS? 2333 se 8257 B25 T° % 485 7 15 109", 

Wag = —3 x 47*% x 1391? x 4618 x 599!? x 691! x 829% x 11516 
x 2347° x 4357° x 178481° x 27962033, 

Wso = —3 x 118 x 10174 x 15118 x 251° x 401° x 601° x 1151! 
x 1301° x 1601° x 1951° x 3851° x 4651® x 5801° x 6101° 
x 445613 x 558313, 

Weg = +37? x BF x 53°) x 79% x 1318 x 157 % 233° « 313 
x 521° x 677° x 1301° x 1613° x 2731° x 6709® x 8191° 
x 13417° x 20593°. 


In 1991, Fee and Granville computed the factors of (the modified) 
W,, for every even n < 200 (including when 6 divides n). From these 
calculations, it followed: 


(2K) Ifk < 100, ifp and 2kp +1 are primes, then the first case 
of Fermat’s last theorem is true for the exponent p. 


PRooF. The result follows from the explicit determination of the 
factors of W., (for 2k < 200) and the verification that the conditions 
of (2B) are satisfied. 


From the values of W2, (for 2k = 2,4,8,16) it follows at once 
that Fermat’s congruence X? + Y? + Z? = 0 (mod q), where p and 
q = 2kp + 1 are odd primes, has only the trivial solution. However 
X°?+Y°%+ Z? =0 (mod 31) and X*?+Y* + Z? =0 (mod 43) have 
nontrivial solution, because 31 | Wi and 43 | Wy4. 

According to a result of Dickson (1909) (see Chapter X, (2C)), if 
p,q are primes and gq > (p—1)?(p— 2)? + 6p — 2 then the congruence 
(2.1) has a nontrivial solution. Therefore, if g = 2kp + 1 and 6 2k 
then by (2B) q | Wag. 

We conclude this section by referring to statements made by Gan- 
dhi (1975, 1976) if the first case of Fermat’s theorem is false for 
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p. These conditions are expressed in terms of appropriate circulant 
determinants, however the proofs were never published, due to the 
untimely death of Gandhi. 


1843 


1853 


1871 


1894 


1895 


1909 


1909 


1910 


1920 


1935 


1935 


1935 


1961 


1968 
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IV.3. Appendix: Sophie Germain’s Primes 


We have mentioned that it is a very difficult problem to know whether 
there exist infinitely many primes p such that 2p + 1 (or 4p + 1, or 


8p +1, 


etc ...) is also a prime. A heuristic argument points to the 


validity of a much more general statement, as we shall explain. 


If x 


is any positive number, we denote by a(x) the number of 


primes p such that p < x. The famous prime number theorem of 
Hadamard and de la Vallée Poussin (1899) states that 


7(x) 


im 
zoo x/logx 


140 IV. Germain’s Theorem 


which we write also as m(x) ~ x/log a. 

Dirichlet’s theorem states that if a,m are relatively prime (pos- 
itive) integers then there exist infinitely many primes in the arith- 
metic progression {a+ km | k > 0}. If ta:m(x) denotes the number 
of primes p in this arithmetic progression and such that p < x, then 


that is, 

1 x 
v(m) loga’ 

Considering the polynomial f(X) = mX + a, Tam(x) represents 
the number of integers n, 1 < n < (a—a)/m such that f(n) is a 
prime. 

More generally, we may consider the following situation. Let 
fi(X), fo(X), ..., fe(X) be polynomials with integral coefficients, 
and positive leading coefficient. Let d; > 1 be the degree of f;(X). 
Assume moreover that these polynomials are irreducible over Q and 
that none is a constant multiple of another. Let N be any positive 
integer and let Q(.N) = Qy,.....,,(N) denote the number of integers 
n,1<n<N, such that fi(n), fo(n),...,f(n) are primes. 

The probability that a large positive integer m be a prime is 
m(m)/m ~ 1/logm, by virtue of the prime number theorem. 

Since we shall be interested in the values of the polynomials f;(X), 
fo(X),... , f¢(X), we have to discount the fact that k-tuples of such 
values are not randomly distributed. 

If p is an arbitrary prime, let s, denote the chance that none of 
the integers of a random k-tuple be divisible by p. Then 


—1\* 1\* 
oY (4) 
Pp Pp 
Similarly, let r, denote the chance that for a random integer n, none 


of the integers fi(n), fo(n),... ,fg(m) be divisible by p. If w(p) de- 
notes the number of solutions of the congruence 


fi(X) fo(X) +++ fe(X) =O (mod p), 


then r, = (p — w(p))/p = 1— w(p)/p. It may be shown that the 
product [],, rp/sp is convergent, say to a limit C= C(fi,... , fx). If 
we agree that this number measures the extent to which the values 


Ta,m (x) ce 
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of f,(X),... , fx(X) form a nonrandom k-tuple, then the probability 
that fi(n), fo(n),...,f(m) are all primes (for large n) is equal to 


1 


log fi(n) ++ - log fx(m) 
1 1 
AS G( Fisses TE, e (log n)F’ 


GU fissas ein) 


since log f;(n) ~ d;logn. Then 


1 soil 
QW) ~ Cin ga 


kr < (log n)* ; 


In particular limy+.Q@(N) = co. Thus, we see in a heuristic way 
that there should exist an infinite number of primes p with the re- 
quired properties. 
Now we consider the following special cases: 
(1) fi(X) =X, fo(X) = 2X+; and 
(2) fi(X) =X, fo(X) = X +2. 
Case (1) deals with Sophie Germain’s primes while case (2) refers 
to twin primes. In both cases w(2) = 1 and w(p) = 2 if p > 2. Hence 
the constant is 


2 
i a" 

C= 2~]] P , = 13203236, 
(1-5) p>2 (1-< 


SO 


N 

N) ~ 1.3203236 SET 

This expression had been conjectured by Hardy and Littlewood 

in 1923, for the count of twin primes less than N. It agrees rather 

closely with the actual number of twin primes; see Sexton (1954), 
Wrench (1961), Shanks (1962), and Brent (1975). 
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V 
Interludes 5 and 6 


In this chapter, we give more background material. 


V.1. p-Adic Numbers 


A. The Field of p-Adic Numbers. In order to study divisibil- 
ity properties by a prime p, it is often convenient to consider the 
development of integers in the base p: 


a= a9 + ap +--+ + amp™, 


with 0 <a;<p—1, p™<a<p™". 

Hensel also introduced infinite p-adic developments. The new 
numbers so defined are the p-adic integers. He described the op- 
erations of addition and multiplication among the p-adic integers 
and proved a very important theorem concerning the existence of 
p-adic integers which are roots of certain polynomials. 

The p-adic numbers may be considered as being limits of sequences 
of integers, relative to the p-adic distance. These considerations al- 
lowed the introduction of methods of analysis in the study of divisi- 
bility properties. 
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We shall describe here very briefly the concepts of p-adic numbers 
and the results which we shall require to discuss Fermat’s equation. 

In Chapter II, §1, we have defined, for every prime p, the p-adic 
valuation v, of Q. We recall that the set of p-integral rational num- 
bers is 


Zp = {5 | a,b Z,b40, pb, god(a,8) = 1) U {0} 


It is a subring of Q containing Z. Moreover, if r € Q then v,(r) > 0 
if and only if r € Zp. 

Z, is called the valuation ring of v,. The only maximal ideal of 
Z, is equal to Z,p = {a/b € Z, | p divides a}. The field Z,/Z,p is 
isomorphic with the field F,, (with p elements); it is called the residue 
field of vp. 

The valuation v, defines, on the field Q, the function d, given as 
follows. If a,b € Q, then 


d,(a,b) = p-*?(2-§) when a 4 b, 
d,(a,a) = 0. 
It is easy to verify the following properties: 
d,(a,b) > 
d,(a,b) = ie a), 
d,(a,b) = d,(a— 6,0), 
d,(a+c,b+c) = d,(a,b), 
d,(a,b) < max{d,(a,c), dp(b, ¢)}, 
dp(a, c) + dy(b ) C)s 
So d, is a distance, compatible with the operation of addition. It is 
called the p-adic distance of Q. 
The completion of Q relative to the p-adic distance is again a field, 
denoted Q, and called the field of p-adic numbers. The nonzero 
elements a of Q, are represented by p-adic developments 


oe) 

4 

a= 5 aip , 
i=™M 


with 0 < a, < p—1,m € Z, anda, #4 0. If an = YL, ip" 
(with n > m) then a = lim, a, (the limit is relative to the p-adic 
distance). 


< 
< 
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The p-adic valuation may be extended by continuity to a valuation 
0, of the field Q,, which is defined by 


Op (> ow) Soe (OF ag. 0), 


Thus, the values of 6, are also integers or infinity. 

The topological closure of Z, in the field Q, is a ring, denoted by 
Deis Its elements are called the p-adic integers. Thus a € Q, is a p- 
adic integer exactly when 0,(a@) > 0. It is also clear that ZpJQ = Zig: 
The only nonzero prime ideal of Zs is LP, consisting of the multiples 
of p. The residue field of 0, is Zin /LpP, which is isomorphic to the 
field F,,. 

If a,f€ Q, we say that a divides G if there exists y € Te such 
that ay = (; this means that 6,(a) < 6,(3). a € Z, is a unit in 
Z,» when a divides 1, i.e., 6,(a) = 0. The set U, of units of Z, is a 
multiplicative group. 

If a, 8,7 € Q, 7 £9, we write a = GP (mod 4) if y divides a — f. 
Similarly, if y € Q,, 7 # 0 and F(X), G(X) € Q,[X] we write 
F(X) = G(X) (mod y) when y divides each coefficient of F(X) — 
G(X). 

These congruence relations satisfy the usual properties of congru- 
ences of integers. 


B. Polynomials with p-Adic Coefficients. We discuss briefly 
polynomials with coefficients in the field Q). 

If f(X) = aX" + a,X" "+--+ +a, € Q,[X], we make the 
definition 6,(f) = mino<i<n{v,p(a;)}. If f,g € Q,[X], with g 4 0, we 
define 

a ei ae 2 

i (2) = a(8) - alo), 
which is well defined. Then @, is a valuation of the field Q,(X), 
whose restriction to Q, is the valuation v,. For simplicity, we shall 
write v,, instead of Up. 

If fig € Q,[X], we write f = g (mod p”) when v,(f — g) > n, 
or equivalently, p” divides each coefficient of f — g. For every f = 
ym, a;X' € Z,[X], we denote by f = f mod p the polynomial 
yn aX! € F,[X]. 


146 _‘*V. Interludes 5 and 6 


We recall now some well-known facts about polynomials in Q,,[X]. 
The polynomial f € 73) is said to be primitive when v,(f) = 0. 
Every polynomial f € Z,[X] may be written as f = af, where 
aéZ, fie Zy[X| and f; is primitive. 


(1A) Gauss’ Lemma. /f f,g € ZLy|X] are primitive polynomials, 
then f -g is also a primitive polynomial. 


(1B) [ff € Z,[X] is primitive and f = g-h with g,h € Q,[X], 
then f = fi-g1 for some primitive polynomials f,,9; € Z,[X], such 
that deg g, = deg g, deg h, = deg h. 


The nonconstant polynomial f € Z,[X] (respectively, f € Q,[X]) 
is irreducible in Z,[X] (respectively, in Q,[X]) if it is impossible to 
write f = g-h, with g,h nonconstant polynomials in Z,[X] (respec- 
tively, Q,[X]). 


(1C) Iffe 71x, then f is irreducible in Zyl X| af and only if it 
is irreducible in Q,|X]. 


The nonconstant polynomials f,g € Zp X ] are said to be relatively 
prime whenever, if h € Z,|X] and h divides f and g, then deg(h) = 0. 


(1D) Jf f € Z,[X] is nonconstant and primitive, and if f does 
not divide the nonconstant polynomial g € Z,|X], then f and g are 
relatively prime. 


(1E) Jf f,g € Z,[X] are nonconstant and relatively prime, then 
there exist polynomials s,t € Z,[X] such that s- f+t-g is a nonzero 
element of Z,. 


(1F) Zffighe LyX], if f is irreducible and if f divides g-h, 
then either f divides g or f divides h. 


(1G) TIfg,he Zp|X] are nonconstant and relatively prime, if g or 
h is primitive and both g and h divide f, then g-h divides f. 


V.1. p-Adic Numbers 147 


(1H) Every nonzero polynomial f € Tosi els may be written as 
a product f = ag,--:Gm, where a € De; and g1,---59n € Zy[X| 
are primitive irreducible polynomials with n > 0. Moreover, a and 
Jis+++5Gn are uniquely defined up to a unit in Z,. 


Now we shall consider the resultant and the discriminant of poly- 
nomials in Z,[X]. 


(11) Let f,g be nonconstant polynomials in LyX]. Then the 
following conditions are equivalent: 
(a) There exists a nonconstant polynomial h € Zy[X] which di- 
vides both f and g. ; 
(b) There exist nonzero polynomials fi,g, € Zp [|X] such that 


deg( fi) < deg(f), deg(gi) < deg(g), and 


n:ft+fieg =O. 


PROOF. (a) = (b) We assume that f = h- fi and g = —h- qi 
with fi,9: € Z,[X]; then deg(f:) < deg(f), deg(g:) < deg(g), and 
an: f+heg=0. 

(b) = (a) Conversely, we assume that there exist polynomials 
fi and g € ZAK, such that deg(f,) < deg(f), deg(g:) < deg(g), 
and gf + fig = 0. If f,g are relatively prime, then by (1E) there 
would exist polynomials s,t € Z 1x such that s-f+t-g=ceé 
Zi, with c #4 0. Eliminating g from the above relations, we obtain 
f(sfi — tg1) = cf; where deg(f;) < deg(f), which is impossible. 


(13) In order that f = =p a,X™~* and g = YY 6; X" 4 (where 
m,n > 0, and f,g € Zp|X]) have a common nonconstant factor, it 
is necessary and sufficient that R(f,g) =0. 


PROOF. It was seen in Chapter II, (4B), that if f,g have a common 
nonconstant factor, then R(f,g) = 0. 

Conversely, by (11) it is equivalent to show the existence of nonzero 
polynomials fi: € Z,[X],g. € Z,[X], fi = Dio GX", w= 
yd: X"-1-*, such that gi - f + fig =0 (it is not excluded that 
co = do = 0). This relation is equivalent to the following system of 
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m-+n equations in the unknown quantities cg,... , Gm—1,do,--- ;dm_1 
(obtained by equating to zero the coefficients of the powers of X): 


dodo + Codo = 0, 
doa, + diag + Cody + C19 — 0, 
doar + da, + dA + Code + Cb, + Cobo = 0, 


This homogeneous linear system has a nontrivial solution in Q, if 
and only if the determinant of its matrix vanishes, or equivalently, 
the determinant of the matrix obtained after exchanging rows and 
columns vanishes; in other words, R(f,g) = 0. Now we finish the 
proof by noticing that if there exists a nontrivial solution in Q,, 
by multiplying by the common denominator of these elements, we 
obtain a nontrivial solution in Lp: 


(1K) Let f,g € Z,[X] be relatively prime nonconstant polynomials, 
such that v,(R(f,g)) =p. Then every nonzero polynomial h € Z,[X] 
such that v,(h) > p and deg(h) < deg(f)+deg(g) may be written in a 
unique way ash = g,-f+fi-g, where fi, 91 € Di Kl, Up( fi) > Up(h)— 
P; Up(gi) 2 Up(h) — p, deg(fi) < deg(f), and deg(gi) < deg(g). 


PROOF. Let 
if = pare Gus 
i=0 


2 eae 
i=0 


g = 
m+n—1 
h= So exmen 
1=0 


We want to determine f, = ar CO = ay d,X"-'* in 
Z,|X] such that h = g,- f+/fi-g. Comparing the coefficients of X in 
both sides of the above relation, we obtain a linear system of m+n 
equations in the m+n unknown quantities c;,d;, whose determinant 
is exactly R(f,g). 

Since f,g are relatively prime, by (1J) we have R(f,g) 4 0; hence 
the above system has a unique solution. 
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The coefficients c;,d; may be computed by Cramer’s rule; their 
numerators are linear forms in the e; with coefficients in Lp (because 
fig € TeX ]), and their denominators are equal to R(f,g). From 
Up(e€;) > Up(h) > 0 = v,(R(f,g)), it follows that v,(c;) > 0, vp(d;) > 


0, so that fi,g, € Z,[X] and v,(f1) > up(h) — p, Vp(gi) > vp(h) — 
p. 


(1L) = Let g € Z,[X] be a nonconstant polynomial. In order that 
there exist a nonconstant polynomial g € Z,[X]| such that g? divides 
f, it is necessary and sufficient that Discr(f) = 0. 


ProorF. In fact, if g? divides f, then g divides f and f’, hence 
Discr(f) = R(f, f’) = 0. Conversely, if Discr(f) = 0, by (1J) there 
exists a nonconstant polynomial f € Zy[X| dividing f and f’; by 
(1H), we may assume that g is irreducible. We have f = g-h, hence 
f' =g'-h+q-h’; since g divides f’, it follows that g divides g’-h; from 
deg(g') < deg(g) we see that g does not divide g’, hence g divides h 
(by (1F)), and so g? divides f. 


We shall now investigate the behavior of the resultant R(f,g) 
when f,g are replaced by sufficiently close polynomials, relative to 
the metric defined by the valuation v, on Q,(X). 


(1M) If fig, fisg. € Zy|X] are nonconstant polynomials and 
up(fi — f) 2 a, (gi. — g) = B, then v,(R(fi,9) — R(f,g)) = 
min{a, 3}. 


Proor. Let a € Z, be such that v,(a) = min{v,(fi — f), v»(g1 — 9) }; 
then fi = f + ah and gi = g+ak, where h,g € Z,[X]. Thus 
R(fi, 91) = R(ft+ah, g+ak). Writing the eliminating matrix between 
ftah, g+ak, and computing the determinant, we obtain R(f,g)+as, 
where s € Ly is a certain sum of products of elements equal to a or 
to coefficients of f,h,g,k. Thus 


vp R(fi, 91) — REF, g)) 2 ep(a) 2 min{a, FF. 


(IN) With the above notations, if f, fi € Z)[X] are nonconstant 
polynomials and v,(f — fi) > a, then v,(Discr(f) — Diser(f,)) > a. 
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ProoF. In fact, since Discr(f) = R(f, f’), Diser(f:) = R(fi, fi’), 
we have only to remark that v,(fi: — f) > @ implies v,(f,1' — J )>a. 
Indeed, if m = max{deg(f),deg(fi)} and f =  ai:xX”"", fi = 
ee DL X™*, then 


m-1 


m—1 
f' = S- Gene Fi = So (m _ Hix 
i=0 


1=0 
thus v,((m—1)(b;—a;)) > vp(b;—a;) for every i = 0,... ,m—1, and so 
Up(fi' —f’) > min{vp(b;—a;) : i =0,...,m—-1} > v,(fi-f) > a. 


We also have at once: 


(10) If fig € LyX] and f = f modp,g mod p, then the 
resultant of f and g (computed in F,[X]) is R(f,g) = R(f,g) and 
the discriminant of f (computed in F,[X]) is Discr(f) = Discr(f). 


We say that the monic nonconstant polynomials f,g € Lp [X] are 
relatively prime modulo p when f,9 are relatively prime polynomials 
in F,[X]. Similarly, f is said to be irreducible modulo p whenever f 
is an irreducible polynomial of F,[X]. Every polynomial f € Lp [X] 
is congruent modulo p to a product of polynomials in Ll X ], which 
are irreducible modulo p, and are uniquely defined modulo p. 

With these definitions, we have: 


QP) figeE Zy[X| are relatively prime modulo p if and only if the 
resultant is a unit in Zp, t.e., if and only if v,(R(f,g)) = 0. 


ProoF. By definition, f,g are relatively prime modulo p when f,g 
are relatively prime polynomials in F,[X]; by Chapter II, (4B), this 
means that R(f,g) 4 0; by (10), this is equivalent to R(f,g) 4 0, 
that is, p does not divide R(f,g), or equivalently, v,(R(f, g)) = 0. 


(1Q) Let f,g € Z,[X] be irreducible polynomials modulo p. Then 
p divides R(f,g) if and only if f =g (mod p). 


ProoF. In fact, p divides R(f,g) exactly when f,g are not relatively 
prime modulo p; hence there exists a nonconstant polynomial h € 
Z,|X]| such that f = h-f; (mod p), g =h-g: (mod p); by hypothesis, 
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we must have f = h (mod p), g=h (mod p), hence f = g (mod p). 
The converse is trivial. 


We observe that if f = S77", € Z,[X] and f’ is a multiple of p, 
then f = )Ojs9 @piX”’ (mod p). Indeed, the coefficients of f’ are 
ja;; thus if p divides f’, then p divides a; when p does not divide 7. 
Therefore, 


f= (= ona" (mod p). 


i>0 


In particular, if f is irreducible modulo p, then f’ 4 0 and hence we 
may consider the discriminant of f modulo p. 


(AR) iffe Zyl X] is irreducible modulo p, then p does not divide 
Discr(f). 


PROOF. We write f = f;+pf2 where all the coefficients of f; are not 
multiples of p. Then f’ = f;'+pf.’ and by (1N), Discr(f) = Discr(f;) 
mod p. If p divides Discr(f), then p divides Discr(f,) = R(f:, fi’). 
By (1P), there exists h € Z,[X] such that h is nonconstant and h 
is a common factor of Pichi Thus f,; = h-gZ, rik = h-k with 
g,k € Z,[X]. Since f, = f is irreducible, then g = with c € Z,. So 
Cc: rn = f,-k. Therefore 


deg(f,') < deg(f:') = deg(F,) < deg(F, - k) = deg(F,’), 


which is absurd. 


(1S) Let f € Z,[X] be such that f is not constant. Then f has a 
multiple irreducible factor modulo p if and only if p divides Discr(f). 


PROOF. We have f = g192°-:gn (mod p), where gi, 92,--- ,Gn are 
irreducible modulo p. Hence, by (1M) and Chapter II, (4D), 


Discr(f) = Diser(gige--- gn) 
ee I Diser(g:) - [][R(gi.9;)]’ (nod p). 


By (1R), p does not divide Discr(g;) for 1 <i <n. Then p divides 
Discr(f) if and only if there exist indices i < 7 such that p divides 
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R(gi,9;). By (1Q) this means that g; = g; (mod p), concluding the 
proof. 


C. Hensel’s Lemma. This very important result, proved by Hensel 
in 1908, is certainly the raison d’étre of the p-adic numbers. It as- 
serts the existence, under appropriate conditions, of p-adic roots of 
polynomials. We shall prove here Hensel’s lemma in its strong form: 


(1T) Let F,g,h € Z,[X] be such that: 

(i) deg(g) = m > 0, deg(h) = n > 0, deg(F) = m+n, g is 

monic and deg(F — gh) < deg(F); 

(ii) v,(R(g, h)) = p => 0; and 

(iii) v,(F — gh) =a > 2p. 
Then there exist G,H € Z,[X] such that v,(G—g) > a—p, v,(H — 
h) > a— p, deg(G) = deg(g), deg(H) = deg(h), G is monic, H,h 
have the same leading coefficient, and finally, F =G-H. 


PROOF. We shall prove the following assertion, for 7 > 0: 

(*) Ifg,he Zyl X], deg(g) = m, deg(h) = n, g is monic, deg(F’— 
gh) < deg(F), vp(F — gh) > a+ j, and v,(R(g, h)) = p, then 
there exist polynomials g*, h* € Zp|X |, such that deg(g*) < 
m, deg(h*) <n, vp(g*) > a+ j — p, Up(h*) > a+ j—p, and 
Up (F—(g+g*)(h—h*)) >atj-1. 

Indeed, since v,(R(g,h)) = p then R(g,h) £0. By (1J), g and h 
are relatively prime. 

We note that v,(F — gh) > a+ jap and deg(F' — gh) < deg(F’) = 
deg(g) + deg(h), and it follows from (1K) that there exist uniquely 
defined polynomials g*,h* € Z,[X], such that the following hold: 


F —gh=h*g+g*h, 

Og) 2 Ug = Oh) =p 2 AF ip; 
Uh") 2 OF = gh) —poats—p, 
deg(g*) < deg(g), 
deg(h*) < deg(h). 
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Therefore 
wl(F—(g+g*)\(A+h*)) = up((F — gh) — (h*g + g*h) — g*h*) 
= U,p(g*h*) 
> 2(a+j-p) 
= (a — 2p) + (a + 23) 
>SPatjt+l. 


We apply this result, beginning with g = go, h = ho, j > 0, and 
letting 91 = go + gj, hi = ho + hj. Then we apply the result for 
01,1,j9 =1, obtaining go = 9g, + gj, he = hi + hj and so on. 

We have to note that 


Up(R(gj41, hj+1)) = vp(R(g;,h;)) = 


for every j > 0, because 


Up [R(gj415 Aji) — R(gj,hy)] 2 min{y,(g"), vp(A")} 
amen Fas J — p> Pp, 


as follows from (1M). 

Thus, we have the sequences of polynomials (g;);>0 and (h;)j>0 
such that deg(g;) = m, deg(h;) =n, each g; is monic, h; and h have 
the same leading coefficient, and finally 


Up(Gj41 — 93) 2 A+I-/P, 
Up(hj41 — hj) 2 at jp. 


Thus (g;)j;>0 and (h;);>9 are Cauchy sequences of polynomials of 
degree m,n, respectively. This means that if 


the sequences (b;;);>0, (Cij)j>0 (for every i), are Cauchy sequences 
in Q,. Since Q, is complete, let b; = limb;;, c; = lime,;, and G = 
Diao OX", H = ig ci X". 
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Then v,(G — g;) > a+ j — p, because 


G-gj)= dim (> via) 
i=0 

and 

Up (S41) Pinky =p, 

i=0 
for every s > 1. Similarly, v,(H —h,;) > a+ -—p for every j > 0. 
Finally, 
vp(F — GH) = v,[(F — gjhj) + (95 — G)H + g(hj — #)| 
min{vp(F' — 9jh;), Up(g; — G) + vp(H), 
Up(g3) + Up(hy — H)} 

= a+ J — p; 


V 


for every 7 > 0. It follows that F = GH with G monic, and H,h 
having the same leading coefficient. 


Now we give Hensel’s lemma in its more customary form: 


(1U) Let F,g,h € Z,[X] be such that: 

(i) deg(g) = m > 0, deg(h) = n > 0, deg(F) = m+n, g is 

monic and deg(F — gh) < deg(F); 

(ii) g,h are relatively prime modulo p; and 

(iii) F = g-h (mod p). 
Then there exist polynomials G,H € Z,|X]| such that G = g (mod p), 
H = h (mod p), deg(G) = deg(g), deg(H) = deg(h), G is monic, 
H, h have the same leading coefficient, and F =G.- H. 


PROOF. This is an immediate corollary of the preceding result. In- 
deed, by (1P), v,(R(g,h)) = 0. Since v,(F — gh) > 1, the above 
result may be applied. 0 


Another commonly encountered form of Hensel’s lemma concerns 
the lifting of roots modulo p. 


(1V) Let F €Z,[X] with deg(F) > 1, let a € Z, be a simple root 
of the congruence F(X) =0 (mod p). Then there exists b € Z, such 
that b= a and F(b) =0. 
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PRooF. By hypothesis, F = (X — a)h (mod p) where h(A) # 0 
(mod p). So X —a,h are relatively prime modulo p. By (1U), 


F = GH, with G monic, deg(G) = 1, G = X — a, thus G= X —b 
with b € Z,, b= 4G, and therefore F'(b) = 0. 


We shall apply Hensel’s lemma to the polynomial X?~! — 1: 


(1W) [fp is a prime then Zs contains p—1 (p—1)th roots of 
1. More precisely, for every j = 1,2,...,p—1 there exists a unique 
element w; € Z, such that we =1 and w; =j (mod p). 


PRooF. For every j = 1,...,p—1, j?-' =1 (mod p), so X?"!-1= 
Meee, 3 — j) (mod p). Thus 1,2,...,p— 1 are all the roots of the 
congruence, and they are simple. By (1V), for every j there exists 
w; € Z, such that wet = 1 and w; = j (mod p). 

For the uniqueness, we observe that if w € Dies we t=1,andw= 
k (mod p), then w must coincide with one of the roots of X?~' — 1, 
say W = w,;; then j =w; =w =k (mod p), soj =k, ie., w = wy. 


Let p be a prime and let (Z/p)* denote the multiplicative group 
of nonzero residue classes modulo p. Let 2 denote the multiplicative 
group of (p — 1)th roots of unity in Z,. 

As a corollary, we have: 


(1X) The mapping which associates to each nonzero residue 
class 7 modulo p the (p — 1)th root of unity w; in Des such that 
w; =Jj (mod p), establishes an isomorphism between the multiplica- 
tive groups (Z/p)* and Q. Moreover, w, is a generator of Q if and 
only if g is a primitive root modulo p. 


ProoF. Indeed, if 1 < 7,k,h < p—1 and jk =h (mod p), by (1W) 
it follows that wjw, = wp, (mod p). Since w; = j (mod p), the map- 
ping 7 modulo p + w, is an isomorphism. The last assertion is 
trivial. 


(1Y) With the above notations: 
(1) iF P—-—1/Jr then gw" = 0; and 
Qipa tl |e then 3 @" =p= 1, 
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ProorFr. (1) Let g bea primitive root modulo p, so w, is a generator 
of the multiplicative group 2. Then 


p-2 J —we-dr 
way eS g ="); 
g 1l—w" 
wEQ j=0 g 


when p—1/r. 
(2) Ifp—1|r then w = 1 for every w € 2, hence gw” = 
p-l. 
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V.2. Linear Recurring Sequences of Second Order 


Let A, B be nonzero integers such that D = A?-4B #0. Let Ro, R, 
be given integers and for every k > 2 let Ry = ARz_1 — BRy_2. 

If A = 1, B = —1 and Ry = 0, Ri = 1, then Ry, is the kth 
Fibonacci number. If A = 1, B = —l and Ry = 2, R; = 1, then 
R, is the kth Lucas number. These sequences of numbers have been 
briefly considered in Chapter IV, 82. 

We shall now indicate several elementary properties of the se- 
quence of numbers (R;,),>0- For our purpose, we shall assume that 
Ry =O and R, = 1. There are analogous results when Rp = 2, Ry = 
A. 

Part (1) of the following lemma was given by Siebeck (1846); it 
is a generalization of Binet’s result for Fibonacci numbers (Chapter 
IV, Lemma 2.3). In 1878, Lucas published a classical paper on this 
subject; see also Lehmer (1930). For further references on recurring 
sequences, see Dickson (1920, Vol. I, pp. 393-411). 
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LEMMA 2.1. With above notations and hypotheses: 


(1) Ifa, are the roots of the equation X* —- AX + B=0, that 
1s, 
A+vV/A?2—4B A-~V/WA?—4B 
ar 


2 p = D ’ 


then for every k > 0: 
ak — BF 
a—B 
(2) If k, h > 1 then Rysh = Ry Rrsi = BR,_1Rzp,. 
(3) (a + B")? = DR? + 4B". 
(5) AR, + (a" + 6") = 2Rnasi for h > 0, 
AR, — (a + B") =2BR,_, forh>1. 
(6) Ifk > 1 ts odd then 


(BD/2/ 
9k-1Rp a ae a Oe 
= & a i 


h=0 


R, = 


ety 
Qk-lgk 4 BR) = 3 ( Jaros 


h=0 


(7) Ifk >1 is odd and n> 1 then 


(k—-1)/2 
kfk-h-1 
Rn — Ptk-)/2 pk a 

f nt », Ate. (ee 


Beh plk-2h-))/2 pk-2h | 
PRooF. (1) From a? — Aa+ = 0 it follows that a*t? = Aa*t! — 
Bo* and similarly B*+? = AG*t! — BBY. Noting that a 4 (3 since 
A? — 4B #0, by subtraction and division by a — 3, we obtain 


gk+2 — gkt2 7 qk+l _ gktl ak — Bk 


a—p a— a—p 
Let Rj, = (a* — B*)/(a — 8) for every k > 0, so Ri. = AR, — 
BRi, for k > 0. Since the sequences (Rz)x>0, (Ri)r>0 Satisfy the 


same recurrence relation and Rj = Ro = 0, Ri, = Ri = 1, then 
R, = Ri, = (a* — B*)/(a—k) for every k > 0. 
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(2) 
RyRazi — BReiRn 
7 (a® — BF)(att! — B+) — aB(ak-! — B*-1)(a — BP) 
- (a — B)? 


kth _ ghth 


ee ee ee Rysn- 


a—p 


(a + 8")? — DR? = a?" + 2a%B" + Br p(<=*) 


azn + 2a!" fe ge” rat (a? = 2a" 8 ae BS) 
dah = 4B", 
since (a — 3)? = D, aB = B. 
(4) Leth =nk. Ifn =1 it is trivial and we proceed by induction: 


Rentiyk = RapReyt —* BRyp-i Ry 


is a multiple of Rx. 
(5) We have A=a+{Q, so by (1), 


AR, + (a + 6" 


_ aaee hi ah 
= (a+ B)— 3 + (a" + 2") 
- qr ee Caer ate eee are eer 
= a B 
— Qh+l 
= ener tas - ; — 2Rndi- 
Similarly, 
AR), — (a” + 6) 
_ a2" hy ah 
= Oh: B)— 3 Slee) 
o qr yar Bit — git TS egg? ag gett 
= —S B 
2 jap — i ies = BR, 
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(6) Let k > 1 be odd. From 
2a=A+VD, 
28=A-VD, 
by raising to the power k, and subtracting, 
rit =2] 2" (gh, eon) 
“+ \2h+1 
Dividing by 2(a — 3) = 2VD, we have 
k-1 ces ki k—2h—1 ph 
gta Oe eS Ga. De 


Similarly, by raising the expressions of 2a and 2G to the power k, 
by subtracting and dividing by 2, we obtain 


(K-/2 / 
2 (ar + 3*) _ S- (°,) Ak-2h pe. 


h=0 
(7) By Chapter VII, (1D), we have the identity 


(k-1)/2 
»k(k-h-1 
k k 

Y) 
ROVE SREY So (Ss vre( te 


h=l 
We take X =a", Y = —@", and since k is odd, 
i Eee k= 


xy x +Y)R-?", 


ark grh—(ar—pryhe Ye S(T Joana oe 


Dividing by a — 3, noting that aG = B and a — 3 = VD then by 
(1) we have 


(k—1)/2 


Rap = D&- D/2 RR + », 


k(k-h-1 Brh p& 2h 1)/2 Rk 2h 
h\ n-1 4 


Now we investigate the divisibility properties of the terms of the 
recurring sequence (Ry) k>0- 

If m > 1 and if there exists an index k such that m divides Rx, 
we denote by r(m) the smallest such index. It is called the rank of 
appearance of m in the sequence (Rx)x>0- 
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LEMMA 2.2. (1) If m > 1 and gcd(m, B) = 1 then there exists 

the rank of appearance of m. 

(2) If m>1 and gcd(m, B) = 1 then m divides R, if and only 
if r(m) divides k. 

(3) If m,n > 1, ged(m, B) = ged(n, B) = ged(m,n) = 1 then 
r(mn) = Iem{r(m),r(n)}. 

(4) Ifp is an odd prime, p / B, then R, = (D/r) (mod p) (where 
(D/p) denotes the Jacobi symbol). 

(5) If p is an odd prime, p | B, then r(p) divides p — (D/p). 

(6) If p is an odd prime, p |) B, if k =v,(Rrw) ande>k, then 
r(p°) = p- *r(p). 

(7) Ifk>1,h>1 then Reng = Re, (mod R?). 

(8) Ifk>1,h>1 then Ryn =kR,RiZ; (mod R2). 


PrROoF. (1) Consider the set of pairs {(R, mod m, R,_; mod m | 
k =1, 2, ...}. Since there exist only finitely many couples of residue 
classes modulo m, there exist integers k,l, with k < 1 such that 


R, = R, (mod m), 
Ry, = Ri-1 (mod m). 


Since BR,» = AR,_, — R; then BRy_» = BR,» (mod m) hence 
Ry-2 = Ri-2 (mod m), because gcd(m,B) = 1. Repeating this 
argument, it follows that 0 = Ry = R)_, (mod m), so there exists 
an integer 1—k > 1 such that m | R,_,. Hence there exists a smallest 
integer r(m) such that m | Rec). 

(2) First we note that for every k > 2, if m divides R, then 
gcd(m, R,-1) = 1. Otherwise there exists a prime p dividing m, R; 
and R,_;. From R;, = AR, — BRyz_2 and gcd(m, B) = 1, it follows 
that p | Ry». Repeating the argument, we would conclude that 
p | R, = 1, which is a contradiction. 

Now let S be the set of all indices k such that m | R,; by (1), 
S #0. We show that ifk,h € S thenk+h eS. Indeed, by Lemma 
2.1(2), m| k+h. 

Similarly, if k,h € Sk < h, then h—k € S. In fact, Rp = 
Ry+(h—k) = Ry Rares — BRy_-1Rn_r, hence m | BR,_1Rp_x- But 
gcd(m, B) = 1 and, as shown above, gcd(m, Rz_1) = 1, hence m | 
Rh_k- 

This suffices to show that S is the set of multiples of its smallest 
element, namely r(m). 
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(3) We have m | mn and mn | Rymn) 80m | Rromny hence 
r(m) | r(mn); similarly r(n) | r(mn) hence | = Iem{r(m),r(n)} 
divides r(mn). 

Conversely, m | Ream) and r(m) | l hence m | R;; similarly n | R:, 
hence from gcd(m,n) = 1 then mn | R), that is, r(mn) | l. 

(4) By Chapter I, (3.3), we have 


Ry = = (a — 6)?" + pf = DP?” + pf. 


Since D®-))/? = (D/p) (mod p) then R, = (2) (mod p). 

(5) Ifp|D then (D/p) =0 and R, =0 (mod p). Let (D/p) = 1; 
we show that R,-; = 0 (mod p) hence by (2), r(p) | p— 1. Indeed, 
by Lemma 2.1(5), and by (4) above, 


2BR,y-1 = AR, — (a? + 6?) = A—(a? +a”) (mod p). 


By Lemma 2.1(6), 2?~'(a? + 3?) = A? (mod p). Hence 


2BR,-1 = 2?BR,1 = 2”? *A-— A? =A-—A?=0 (mod p). 
Since p / B, p # 2, then R,-; =0 (mod p). 
Now let (D/p) = —1; we show that R,,; = 0 (mod p), hence by 
(2), r(p) |p +1. Indeed, by Lemma 2.1(5), and by (4) above, 
2Ro41 = AR, + (a? + 6?) = -—A+(a?+ 6") (mod p). 
By Lemma 2.1(6), 2?~'(a? + G?) = A? (mod p), hence 


2Rys1 = 2? Ry. = —2? 1A+ AP =—-A+ A? =0 (mod p). 


Since p # 2 then R,4; =0 (mod p). 
(6) Let Ri») = p*t, with p /t, k > 1. By Lemma 2.1(7), we have 
for m> 1, 


Re(pypm = DO VPORE 


(p™—1)/2 

| MBG iho (some | r(p)h Fy (p™—2h—1)/2 pp” —2h 

a a =( h—1 )p ee Rip) 
h=1 


For h = (p™ — 1)/2 the summand is equal to p”™B")@"-D/? R,/,) 
and its p-adic value is m +k, since p / B. 
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IfO<h< (p™ —1)/2 then 


p™ (p™ —h—-1) prwyn pip™—2h-1)/2 po™—2h 
ol (P 88) ros norm 


p™ pp™— m 
> oy (BRE) =m + (om — 2A) ay(h) 
However, ifm > 1 and0<h < (p™—1)/2 then p™ > 2h+v,(h) +1. 
Indeed, 2h = p” —p*r with p /r, 0< s < mand if s=0thenr > 1; 
so 2h+v,(h) +1=p™"-—p*r+s+l<p™ because s+1 < p*r (since 
p # 2). Hence, 


m+ (p™ — 2h)k — v,(h) > m+v,(h)\(k-l+k>mt+k. 
We have also 
Dee) > p"k > m-+k, 


when m > 1 (because p # 2). Thus, for m > 1 we have vp (Ri(pypm) = 
m+k. This is also true when m = 0. 

Taking m = e—k > 0 then vy (Ryp(pype-e) = € SO p* | Ry(pype-e and 
therefore, by (2), r(p*) | r(p)p®~*. Since p* | Ry pe) then r(p) | r(p*). 
Hence r(p°) = r(p)p” with 0 < m < e—k. If m < e—k then 
Up(Ri(pypm) = M+k <e, so p® / Rywp)pm hence r(p°) 4 r(p)p™ and 
this shows that r(p°) = r(p)p**. 

(7) The proof is by induction, being trivial when k = 1. We have 
by Lemma 2.1(2): 


Resaynsa = Reensiy+r 
= Renzi Ray — BRenRn 
= RengiRnsi 
= REI (mod R?) 


by induction and since R,, divides Ry», (by Lemma 2.1(4)). 
(8) The proof is by induction, being trivial when k = 1. We 
have, by Part (7), by Lemma 2.1(2) and (4), and by induction: 
Retin = Renta = RaRenti — BRr-1 Ren 
= Fh a, = BR AAkR, RT 
= RAR Rng — BeRn-i] 
= R,RRi(kK+1)Ragi = (kK +1)RaRi,, (mod Rj), 
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noting that 


1843 


1846 


1878 


1920 


1930 


1995 


Raa = R,Ro = BRh_1 => —BRp_1 (mod Ry). 
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VI 


Arithmetic Restrictions on 
Hypothetical Solutions 
and on the Exponent 


Let p be an odd prime and assume that x,y, z are nonzero pairwise 
relatively prime integers such that «? + y? + z? = 0. 

In this chapter we indicate congruences and divisibility properties 
satisfied by expressions involving the numbers z,y,z,p. In some 
instances, we will be able to reach a contradiction, proving that 
Fermat’s last theorem (or the first case) holds for certain exponents 
p. In Section 3 we focus on a conjecture of Abel, which has not yet 
been completely established by a direct proof. 


VI.1. Congruences 


Let p be an odd prime and assume that 2, y, z are nonzero relatively 
prime integers such that x? + y? + z? = 0. For easy reference, we 
recall results from Chapter ITI, §1. 

If p / xyz then there exist nonzero integers r,s,t,71,$1,t1 such 
that 


rt+y=?, (a? + y?)/(a+y) =, z= tt, 
(LD pear, CPP Ge2) =i a= rry, 


z+n2=s?, (2? +a?) /(z+z2)=s8s), y= -S851, 
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p /rstr,s,t, and r, s,t,71, $1, ¢, are pairwise relatively prime. More- 
over r? + s? + 1? £0. 

If p /xy but p | z, then there exist nonzero integers r, 5,t,71, $1, t1, 
and n > 2 such that 


(1.2) 
ety=prr?, (a ty \/(et+y) =p, 2=—prth, 
ytz=r?, (y?+ 2?)/(a+ty)=r), L=-TT, 
t+y=s?, (2? +a?)/(e+y)=s, y= ss, 


pj rstr,s,t, and r,s,t,1r1,8,t; are pairwise relatively prime. We 
have r? + s? + pPr-ltP £ 0. 

Moreover: If p / xyz then r,; = 1 (mod 2p”), s; = 1 (mod 2p’), 
and t; = 1 (mod 2p’). If p | z, p xy then r; = 1 (mod 2p), s; =1 
(mod 2p), and t; = 1 (mod 2p’). If p /xyz then 


CSS ak, 
(1.3) y=—s? +k, 
z=-—t+k, 


where k = (r? + s? + t?)/2. If p| z, p xy then 


gS—r? +k, 
(1.4) y= —s?+k, 
z= —pPr—le +k, 


where k = (r? + s? + pert?) /2. 

We begin with an easy congruence, soon to be reinforced. From 
x? =x (mod p), y? = y (mod p), z? = z (mod p) then —z = —z? = 
x? + yP =ax+y (mod p), so 2? + y? = —2z? = (a+ y)? (mod p’). 

The first result is due to Fleck (1909). It was given by Lind in a 
weaker form, in 1910. It was rediscovered by Frobenius in 1914 and 
again by Vandiver (1914), Pomey (1923) and Pérez-Cacho (1958). 


(1A) Let p be an odd prime, and let x,y,z be nonzero relatively 
prime integers, such that x? + y? + z? = 0. 

(1) If p does not divide x then x?~' = 1 (mod p’*). 

(2) Ifp /xyz then (x+y)? = 2? + y? (mod p*). 
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PrROoF. (1) Ist Case: p does not divide yz. As recalled above, 
r, = 1 (mod p”). Hence « = —rr, = —r (mod p”) and x? = —r? 
(mod p?). 

By symmetry, we have also y? = —s? (mod p?) and z? = —t? 
(mod p*). Since x? + y? + 2? = 0 then r? + 5? +t? =0 (mod p’®). It 
follows from the above relations (1.3) that « = —r? (mod p*). Hence 
x? =x (mod p®) and x?~' = 1 (mod p?). 


2nd Case: Assume that p divides z, hence p does not divide y. 
As recalled above, t; = 1 (mod p”), hence t? = 1 (mod p®). 

On the other hand, pn — 1 > 4, so x = —y (mod p*) and pt? = 
(x? +y?) /(at+y) = ah —aP yt. --— ay? 7 +P"! = pxP-* (mod p*) 
hence 2?~' = t? =1 (mod p’®). 

(2) We have 0 = 2? + y? +2? = x2+y+4+z (mod p®), hence 
xr+y =-—z (mod p®). Then (x+y)? = —z? = x? + y? (mod p*). 


With methods from Class Field Theory, as a consequence of the- 
orems of Furtwangler, Vandiver proved (1914, 1919): 


(1B) With above notations: 2? = x (mod p®), y? = y (mod p’), 
2? =z (mod p®), andzr+y+z=0 (mod p*). 


The result which follows is again due to Fleck (1909). Partial as- 
sertions were rediscovered by Pomey (1923), Vandiver (1925), James 
(1934), Niewiadomski (1938), and Inkeri (1946). 


(1C) Let x,y,z be nonzero relatively prime integers such that 
xP ty? + 2? = 0. 
(1) Ifp/ayz thenx+y+z is a multiple of 6 and of rstp? and 
r+s+t is a multiple of p?. 
(2) Ifp|z thenx+y+z is a multiple of 6 and of rstp? and r-+s 
is a multiple of p, while r+s+t is not a multiple of p. 


PROOF. It follows from the relations (1.1) (respectively, (1.2)), that 
in both cases r,s,t divide x + y+ z. Also, x + y+ z is even and 
x? =x (mod 3), y? = y (mod 3), z? = z (mod 3), hence r+-y+2z= 
x? + y? + z? = 0 (mod 3). 

In the first case, it follows from (1A) that x? = x (mod p*), y? = y 
(mod p*), 2? = z (mod p®), hence x + y+ z = 0 (mod p?). Since 
p\ rst then rstp*® divides x + y+ z. 
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As it was recalled above, r; = 1 (mod p”), s; = 1 (mod p”), t; = 
1 (mod p?). It follows from relations (1.1) that « = —r (mod p’), 
y = —s (mod p*), z = —t (mod p?), hence r+ s+t=0 (mod p’). 

Assuming that p | z, then p / xy, hence it follows from relations 
(1.2) that p” divides z and x + y (with n > 2), hence p* divides 
x+y+z; since p /rst then p*rst divides x + y+ z. 

As recalled above, r; = 1 (mod p), s; = 1 (mod p) so by relations 
(1.2), c =—r (mod p), y = —s (mod p). Hence r+s=—(r#+y)= 
z=0 (mod p). Thus r+s+t=t#0 (mod p). 


The next proposition was proved, in the first case, by Spunar 
(1929) and James (1934); a simpler proof was given by Segal (1938). 


(1D) Let x,y,z be nonzero relatively prime integers such that 
xP + yP + 2? = 0. 
(1) Ifp xyz thenr+s+t#0. 
(2) Ifp|z thenr+s+p"t £0 and alsor+s+t#0 (where n 
was defined in (1.2)). 


Proor. (1) Assume that p /xyz and r+s+t=0. Since ged(r, s, t) 
= 1 we may suppose, for example, that r,s are odd while ¢ is even. 
Then r? + 5? = (y+z)+(z+2) =2+y+2z = @? — 2tt, (by relations 
(1.1)). Hence 
os 2t, — ot. 
r+s t 

The left-hand side is equal to r?~'— r?~?5+.---—rs?~? +4 5?-!, hence 
it is the sum of p odd numbers, so it is odd. On the other hand, 
2t, — t?-' is even, which is impossible. Hence r+s+t 4 0 in the 
first case. 

(2) Ifp|zthen p/zy and by relations (1.2), r? +s? =(yt+z)+ 
(zg+a2)=a+y4 22 = prt? — 2p"tt,. Hence, ifr+s+p"t = 0 
then 


ils kt a Oty — pPror lye}, 

r+s prt 
Since p 't, and n > 2 then p does not divide (r? + s”)/(r+s). On 
the other hand, by Chapter IT, (3R)(4), gcd(p, r+s) = ged(r+s, (r?+ 
s?)/(r+s)). By (1C), p | r+s, hence p divides (r?+s”)/(r+s), which 
is a contradiction. From (1C), since p /r+s-+t, then r+s+t 4 0. 
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In connection with this result, Raclig considered in 1944 the fol- 
lowing conjecture, where p is an odd prime: 


(R’,) If a,b,c are nonzero integers and pabc divides a? + b? + cP 
thena+b+c=0 ora?+b?4+c?=0. 


Raclis showed: 


(1E) = Let p be an odd prime. If (Rj) is true then the first case of 
Fermat’s theorem is true for the exponent p. 


PRooF. Assume that x,y,z are nonzero pairwise relatively prime 
integers, not multiples of p, such that x? + y? + z? = 0. Let r,s,t 
be defined as in relations (1.1), so r,s,¢ are nonzero integers. We 
remarked after (1.1) that r? + s? + t? £0, and by (1D), r+s+t# 
0. By (1C), p divides r+ s+t. By (1.1) and (1.3), r,s,t divide 
r? + 5P+¢?, and since p,r, s,t are pairwise relatively prime then prst 
divides r? + s? + ¢?, showing that (Rj) is not true. 


The validity of (Rj) is very questionable. (Ry) and (Rs) are false: 
p=3,a=b=c=1 and p=5, a = 33, b = —2, c = —1 provide 
counterexamples. 

Similarly, consider the following statement: 


(R}) Ifa,b,c are nonzero integers and pabc divides aP + bP + pPr—lcop 
(for some n > 2) thena+b+ p"c =0 or a? + bP + prr-'ce? = 0. 


Then: 


(1F) Let p be an odd prime. If (R}) is true then the second case 
of Fermat’s theorem is true for the exponent p. 


ProoF. Assume that x,y,z are pairwise relatively prime integers, 
such that p | z and x? + y?+z? = 0. Let r,s,t, and n be defined as in 
relations (1.2), so r,s,t are nonzero integers, n > 2. Then by (1D), 
r+s+p"t #0, and by the remark after (1.2), r? + 5? + p?”—!t? £0. 

However, as already shown in (1C), r+s =0 (mod p), so p divides 
rP + sP + pPr—ltP_. By (1.4), r,s,t divide r? + s? + p?”—!t? and since 
p,r,s,t are pairwise relatively prime then prst divides r? + s? + 
pet. This contradicts the assumption (R’). 
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In 1946, Inkeri generalized (1A). He also proved the corresponding 
result for the second case, with a more powerful method from Class 
Field Theory. 


(1G) [fp is an odd prime, n> 1, if x,y,z are nonzero relatively 
prime integers such that x?” +y?" + 2" =0, and if p does not divide 
x then 2-1 =1 (mod p?"t"), 


PROOF. Since x,y, z are pairwise relatively prime, we may assume, 
for example, that p /y.We have 


—a?" = y? + 2?" = (y+2) II Q» ees 2") ; 


m=1 


By Chapter II, (3C)(2), the factors on the right are pairwise rela- 
tively prime, so y+ z= a?" where a| x, p //a. Similarly, x + z = b?” 
where b| y, p /b. 

If p / z, we have similarly x + y = c?” where c| z, p /c¢ and (a? + 
y’)/(a + y) = d?” where d | z,p /dand gcd(c,d) = 1. However, if 
p | 2, by Chapter II, (3C)(3), x+y = pc?”, (a? + y?)/(e@+y) = pd?” 
where c| z,d|z,p/c, pd, ged(c,d) =1,h>1. 

We show that if g is any prime dividing d (whether p | z or p /z) 
then g = 1 (mod p”*'). Indeed, since q| d then q | #?+y?,q|z,q4 
p, and qc, hence q /x + y. So 


y =a" (mod q), 
x = b?" (mod q), 


hence q Ja?" +b?". But a?"™' +b?" = x? + y? =0 (mod q). There- 
fore q is a primitive factor of the binomial a?""’ + b?"”'. By Chapter 
II, (3G), g=1 (mod p”*"). It follows that d= 1 (mod p"*'), hence 
d?” =1 (mod p?"*1). 

To conclude the proof, we examine separately the two cases. 

If p x then (x? +y?)/(a+y) =1 (mod p?”"*") hence x? +y? = x+y 
(mod p?”*"). By symmetry, y? + 2? = y+ z (mod p?”*") and x? + 
2? =x+2z (mod p*”"*'). Adding up these congruences, dividing by 
2, and subtracting the second one, we obtain x? = x (mod p?"*'), 
hence z?~' = 1 (mod p?"*?). 

If p | z; let u(z) = k > 1, so uv (2"") = p"k > n+1. By 
Chapter II, (3C)(4), v,(z@ + y) = p"k — n, so x = —y (mod p*?"—-"), 
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Bg gP P 


= pz’ (mod greys 


But kp” —n > 2n+ 2, because p > 3 (otherwise the hypothesis is 
not satisfied); hence x?~1 = d?” =1 (mod p?"*?). 


The next group of results will involve congruences of the type 
(142) =1+2”" (mod p**t?). 


It is convenient to precede the discussion with the following easy 
result, which is given explicitly by Ferentinou-Nicolacopoulou (1965): 


LEMMA 1.1. Let p be an odd prime not dividing the integer a. 
(1) Ifn>m2>0 then a? =a?” ™ (mod p?-™*?), 
(2) Ifk>1 then a? — a? = ap**(a?-! — 1) (mod p*t?), 
If p also does not divide a+ 1: 
(3) Ifk >1 then [(a+1)” — a? —1)-[(a+1)"* — a” 
= p*-![(a +1)? — a? — 1] (mod p**"). 
(4) Ifk > 2 then (a+1)” =a?" +1 (mod p*t!) if and only if 
(a +1)" =a?" +1 (mod p**). 
(5) Ifk > 1 and (a+1)?" =a” +1 (mod p**?) then (a+1)" | = 


k-1 


a? +1 (mod p**"), 


-1 


— 1] 


Proor. (1) a?” - = a? = a (mod p), hence raising to the 


power p”—™: 


a?’ =a?” (mod p™™*?), 
(2) a? — a? =a? "fa @-) — 1). If a?! = 1+ bp then 
(aP-1)P = 1+ bp" (mod pet), 


hence 


(ar)? 1=bp*' = (ar 1) p’-* (mod p**), 


Since a?” ' =a (mod p) then 


k k-1 


a? —aP =a(a?-1—-1)p*" (mod p**"). 


172 ~=VI. Arithmetic Restrictions 


(3) By (2) and the hypothesis, p does not divide a+ 1: 
(a+ 1)" = (a+ I" = (at 1 ((at 1) = 1) (mod gp) 


and similarly 


Therefore, by subtraction, 


k k-1 k-1 


(ear a aF Si = eae Sr aa 


(4) First note that if k > 2 then a?° =a?" ' =a? (mod p?) and 
also 
(a +1)" =(a+1)"" =(a+1)? (mod p?). 


Thus 
Qe a9" Sta Gai Se? S41 
= (a+1)?—a’—1 (mod p’) 
If 
(a+ 1)? —a? —1=0 (mod p**") 
then also 


The converse is proved in the same way. 
(5) By hypothesis, (a + 1)? = a? +1 (mod p*t") hence also 
(a+ 1)" =a? +1 (mod p*™). 


In particular, as was noted by Birkhoff and was published by 
Carmichael in his second note of 1913, if p does not divide a, nor 
a+1, then (1+a)? = 1+a” (mod p*) if and only if (1+a)” =1+a”" 
(mod p?). 

The following result was given by Klosgen in 1970: 


(1H) Let p be an odd prime, andm > 1. The following conditions 
are equivalent: 
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(1) There exist integers x,y,z, not multiples of p, satisfying the 
congruence 


oP” + yP™ 4 2?" =0 (mod p™"?). 
(2) There exists an integer a, 1<a< (p—3)/2, such that 
1+a?" =(1+a)?” (mod p™*’). 


Moreover, if any two of the numbers x,y,z are congruent modulo p 
then 2?"-1 =1 (mod p™*?), 


PROOF. We need only to show that (1) implies (2), since the other 
implication is trivial. Let z’ be an integer such that z’z = 1 (mod p). 
Let a = z'x (mod p), —b = 2’y (mod p), where 1 <a<p-—1,1< 
b<p-—1. Then 


a?” = 2/?" 2?" (mod p™*), pp” = 2?” (mod p™*?), 


so aP” +1= 0?" (mod p™*"). If b=a+t (mod p) then a?” +1 
bP” = (a+t)” =a+t (mod p) sot = 1 (mod p), and a?” +1 
(a+1)”" (mod p™*1), with1<a<a+1<p-1l. 

If a= (p—1)/2 then a+ 1 = (p+ 1)/2 = —(p—1)/2 (mod p) so 


(p— 1)?" +2?" =—(p—1)?" (mod p™*") 


hence 2?” = 2 (mod p™*') and we take a = 1, since 1?" +1=(1+ 
1)?" (mod p™*?). If (p—1)/2 <a < p—2 then taking a, = p—1—a 
we have 1 < a, < (p— 3)/2 and 


1+a?” =(1+a,)" (mod p™*). 


For the last assertion, we may assume, for example, that x = y 
(mod p), the other cases being similar. In the course of the proof, we 
had a = 2’x = z'y = —b (mod p), b=a+1 (mod p) hence 2a = —1 
(mod p), so a = (p—1)/2. Therefore 2?"-! = 1 (mod p™*?), 


An immediate corollary of this result is the following. 

If the first case of Fermat’s theorem fails for the exponent p, 
that is, if there exist integers x,y,z, not multiples of p, such that 
x? + y? + z? = 0, then we also have x? + y? + 2? = 0 (mod pp”). 
There exists a, 1 < a < (p—3)/2, such that 1+ a? = (1+)? 
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(mod p?). Carmichael (1913) and Meissner (1914) obtained the fol- 
lowing more precise results (the statement for the first case is also 
given by Gandhi, in 1975). 


(11) Let x,y,z be nonzero relatively prime integers such that x? + 
y+ 2=0. 

(1) If p / xyz there exists an integer a, 1 < a < (p—3)/2 such 
that 


(1+a)?=1+a” (mod p*), 


or equivalently, (1+ a)? =1+ a? (mod p’). 
(2) If p| xyz there exists an integer a, 1 <a < (p—3)/2, such 
that (1+a)? =1+ a? (mod p’). 


Proor. (1) By (1A), 2? =a (mod p’) so 2?” = a? = x (mod p’). 
Similarly y?” = y (mod p*) and 2” = z (mod p*). Hence, by (1C), 
xP + yP +2? =xt+y+z=0 (mod p?*). 

By (1H) there exists a, 1 < a < (p—3)/2, such that (1+ a)” = 
1+ a” (mod p?) and by Birkhoff’s remark following Lemma 1.1, 
(1+ a)? =1+ a? (mod p’). 

(2) If p | xyz, we may assume, for example, that p / xy, p | 
z. According to (1C), x +y+2z=0 (mod p?). Then (x+y)? = 
—zP = xP + y (mod p?). Since x # 0 (mod p), y # 0 (mod p), 
there exists an integer b, 1 < b < p—1 such that y = bx (mod p). 
Then y? = b?x? (mod p”) and (x+y)? = x?(1+ 6)? (mod p*). Thus 
x?(1 +b)? = x?(1 +6") (mod p’). Since p x then (1 + b)? = 1+ b? 
(mod p’). 

As in (1H), if 1 < b < (p—3)/2 we take a = 6, if (p—1)/2 < 
b < p—1 we take a = p—1-— band if b = (p—1)/2 then we take 
a=l. 


——_—— 


We remark that the above criterion for the first case is useless 
when p = 1 (mod 6), as pointed out by Birkhoff (in Carmichael’s 
second note, 1913). For this purpose, we establish the following easy 
lemma: 


LEMMA 1.2. Let p be an odd prime. Then p = 1 (mod 6) if and 
only if p € 3 and there exists an integer t, 1 <<t <p—1, such that 
t?+t+1=0 (mod p). 
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ProoF. Let p = 6k +1, let g be a primitive root modulo p. Let t 
be such that 1 < t <p—1 and t= g?* (mod p). Sot #1 (mod p), 
but t? = 1 (mod p). Hence t?+¢+1 = (t?—1)/(t—1) =0 (mod p). 

Conversely, if t?+t+1 = 0 (mod p), then (2t+1)? = 4t?+4t+1= 
A(t?+t+1)—3 = —3 (mod p). So —3 is asquare modulo p # 3. Hence 
1 = (—3/p) = (p/3) so p=1 (mod 3), hence p= 1 (mod 6). 


Now we use the following fact established by Cauchy in 1841: if 
p = 1 (mod 6) then the polynomial pX (X +1)(X?+X +1)? divides 
(X +1)? — X” — 1 (see Chapter VII, (2A)). 

Thus, if p= 1 (mod 6), by the lemma there exists t, 1 <t < p—1, 
such that t? +¢+1= 0 (mod p). It follows that (1+t)? =1+? 
(mod p*) and by a remark of Birkhoff, (1+ ¢)" =1+#? (mod p?). 
Proceeding as in the proof of (1H), there exists a, 1 < a < (p—3)/2, 
such that (1 +a)” =1+a® (mod p?). 

This establishes the assertion that if p= 1 (mod 6) then the cri- 
terion of (11) is useless in the first case. 

Wagstaff verified in 1975 that for every prime p < 100000, p= —1 
mod 6, the congruence (1 + x)? = 1+ 2? (mod p®) has no solution 
in integers a,1 <a < (p—3)/2. In this way, it was proved that the 
first case is true for such exponents. 

In the same year of 1975, Gandhi had independently suggested 
that such computations be performed. (11) may be rephrased as 
follows: 


(1J) Let g be a primitive root modulo p. 
(1) If1+g?”’+g*” £0 (mod p®) for all indices j,k =1,... ,p—1 
then the first case of Fermat’s theorem holds for the exponent 
D. 
(2) If1+g3?+g*? £0 (mod p?) for all indices j,k =1,...,p—1 
then Fermat’s theorem holds for the exponent p. 


ProorF. (1) If the first case fails for the exponent p, there exists 
a,1 <a < (p—3)/2, such that (1 +a)” = 1+ a” (mod p’). 
Note that a 4 0, —1 (mod p). Let j,k be indices such that a = g’ 
(mod p), —(l +a) = g* (mod p). Then a” = g’”” (mod p*), (1+ 
a)” = —g*” (mod p’) and therefore 1+ g’”” + g*”” =0 (mod p’). 
(2) If Fermat’s theorem fails for the exponent p, there exists a, 1 < 
a < (p—3)/2 such that (1+ a)? = 1+ a? (mod p”). We conclude 
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similarly that there exist indices j,k such that 1 + g/? + g*? = 0 
(mod p?). 


In 1950, Trypanis announced without proof the following strength- 
ening of Carmichael’s result. It was rediscovered by Ferentinou- 
Nicolacopoulou in 1965 (we present her proof) and generalized by 
Klésgen in 1970 (see (1J)). 


(1K) _  [f the first case of Fermat’s theorem fails for the exponent 
p> 5, there exists an integer a, 1<a< (p—5)/2, such that 


(1+a)” =1+a? (mod p'), 
or equivalently 


(1+a)” =1+a” (mod p’). 


PRooF. Assume that x, y, z are positive integers, not multiples of p, 
such that 


xP + y? = 2. 
Let t be the order of p modulo z, so t is the smallest positive integer 
such that 

p' =1 (mod 2). 
We write p' — 1 = dz, for some integer d > 0. Clearly p does not 
divide d. Let m = dx,n = dy, hence p does not divide m,n and 


m? +n? = (p' —1)?. 


So m,n, —(p'—1) satisfy Fermat’s equation. By (1C), m+n = p'-1 
(mod p?). 

Let us note that m,n are less than p’ — 1 and since (m+n)? > 
m? +n? then m+n> p'—1,so0 p'<m+n+1< 2p‘. This implies 
that t > 4, because if t < 3 then 


m+n=p'—1 (mod p’), 


that is, m+n+1=0 (mod p‘), which is not possible. 
Since m+n = —1 (mod p*) then (n+ 1)? = —m? (mod p*). Also 
from m? + n? = (p' — 1)? we deduce that 


n?+1=—m? (mod p'*), 
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therefore n? + 1 = —m? (mod p’), and combining with the previous 
congruence, 

(n+1)?=n?+1 (mod p*). 
By (1A), m?-1} = n?-1 = (p'—1)?-1 =1 (mod p’) and fromn+1= 
—m (mod p*) we have 


(n+1)?t=m?*=n?"'=1 (mod p’). 


As is known, there are y(p*) = p?(p— 1) invertible residue classes 
modulo p®, and they form a multiplicative cyclic group. If the residue 
class of w modulo p? is any generator of this group, from 


n®-'=1 (mod p*) and = (n+1)?'=1 (mod p?) 
it follows that the orders of the residue classes n mod p® and (n+1) 
mod p® divide p — 1; so there exist positive integers h,k, 0 <h, k < 


p — 2, such that , 
n=w'? (mod p*) 
and ’ 
n+1=w*? (mod p*). 
Let b= w" (mod p). Then 


n=? =b (mod p) 


and 


k 


n+1=w'” =w* (mod p), 


so we =b+1 (mod p) and we deduce that 
n? = 0? (mod p*) 
and ; 
(n+1)?=(b+1)” (mod p’). 
Since (n + 1)? = n? +1 (mod p%) it follows that 
(b+ 1)" =b” +1. (mod p’). 


Let us note that p does not divide 6 nor 6+ 1. If 1 < 6 < (p—5)/2 
we take a = b. If b = (p —3)/2, from 


CRY ERY +1 ona 


we deduce that 
== 23?" 4 9°" (med p’), 
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hence we may take a = 2. If b = (p—1)/2 then 


3)" 41 cote 


2 2 


hence 
= 1 2" (med p*); 


so we take a = 1. If (p+1)/2 < b < p—1, leta = p—1-—Db, so 
1<a<(p—3)/2 and 


GHD? Sar S164) Stee? sonedp 


By Lemma 1.1(4), it follows that 


(a+1)”" =a” +1 (mod p’). 


Let us note that according to Lemma 1.1(5), (1K) is in fact a 
strengthening of (11). 

From these results, we obtain as an immediate corollary the one 
indicated by Gandhi (1976): 


(1L) [If the first case of Fermat’s theorem fails for the exponent p 
then there exists an integer b, not a multiple of p, such that (1+6)? = 
1+ 0? (mod p’*). 


PROOF. By (1J) there exists a, not a multiple of p, such that 
(1+a)” =1+a” (mod p’). 
By Lemma 1.1, 
(1+a)?=1+a? (mod p*). 
Raising to the pth power: 
(1+a)" =(1+a?)’ (mod p'), 


so 
(1+a?)?=1+a” (mod p’). 
Letting b = a? then (1 +b)? =1+ 0? (mod p%*). 
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Bréié-Kostié proved in 1952 a result of the same kind, under some 
special conditions on the prime exponent p. Klosgen proved in 1970 
the following generalization of (1K): 


(1M) Ifn > 1, if x,y,z are integers, not multiples of the odd 
prime p, such that x?” + y®" + 2?" =0, then 


oP” 4 yP™ 4 2" = 0 (mod p*"*4), 
or equivalently, there exists an integer a, 1 < a < (p—3)/2, such 


that 
(1+a)"" =1+a”" (mod p***), 


PRroorF. By (1G), 2? = x (mod p?"*'). Raising to the power p” : 

aP?"** = x?" (mod p?"*1), Again, a?" = 2?" = a?” (mod p31) 
< % 3n n < 2 

and repeating this procedure, 2? = x? (mod p®”*'!). Similarly, 


y?” = y?" (mod p?”*") and 2?” = 2" (mod p?"*!). Hence 


oP" yh 4 oP” = oP” + yh” + 2?" = 0 (mod p*"*1), 


The last assertion was proved in (1H). 


Taking n = 1, we obtain (1J). 

Johnson investigated (in 1977) whether the congruences of Carmi- 
chael and Trypanis may be further strengthened, modulo every power 
p"*? (n > 1). This is a typical situation to be handled by p-adic 
methods. ) 

Let a; € Z, denote the unique p-adic integer which is a (p — 
1)th root of unity and is such that a; = 7 (mod p), for every 7 = 
1,2,... ,p—1 (see Chapter V, (1W)). 


(1N) Let p be a prime, p > 3, and let a be an integer such that 
p does not divide a nora+1. Then the following conditions are 
equivalent: 

(1) for everyn > 1: (1+a)?” =1+a”" (mod p"*?); 

(2) A+ A = A140; and 

(3) a2 +a+1=0 (mod p). 


PROOF. The p-adic development of a, is of the form a, = a+ paP, 
with pz € Z, and pz is uniquely defined by ag. We show that for 
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every n > 0 we have the congruence 
(1.5) O, =a?” + p.p"* (mod p**?). 


For n = 0 it is trivial. We assume that (1.5) is true for n. Then from 
a’-! = | it follows that 


Qq, = (a)? = (aP” + pap”*")? 


a + pap”t2aP?” 1) 


ae pap"? (mod p"**), 


since a?" -1) = 1 (mod p). 

Now we show the equivalence of the statements (1), (2), (3). 

(1) > (2) For every n > 1,1+ a" = (14 a)”” (mod p"*?), 
Hence by the above congruence (1.5): 


1 + AA = 1 + a? + Pap" = (1 + a)" + pap (mod per), 


On the other hand, a1,, = (1 +a)?" + pizap"*! (mod p"*?). Hence 
1+ Q¢ = 144 (mod p”*') for every n > 1. From the uniqueness of 
the p-adic development it follows that 1+ a, = Qi4¢. 

(2) + (1) We have 


Q+a= (1 + a)?" a pieip? (mod po) 
and 
L+ag=1+a?" 4+ pap"t! (mod p"*?), 


By hypothesis, aj4, = 1+a,, so (14+ a4) 4+ piyap = 14+ (a4 pap). 
By the uniqueness of the p-adic developments, $11, = Pa, hence 
(1 +a)?" =1+ a" (mod p"*?). 

(2) — (3) The p-adic (p — 1)th roots of unity constitute a mul- 
tiplicative cyclic group. Let a; be a generator. We consider the 
subfield Q(a;) of Q,. It is a Galois extension of Q. Let o be the 
automorphism such that o(a;) = a;'. Since a, = (a;)* (for some 
exponent k) then o(a,) = (a;)~* = az'. Similarly o(a144) = AT fa- 

From the hypothesis 1+ a, = Q14,, we deduce by applying o that 
1+az!=aj7{,- Hence 


AaQAi+ta + A1+a = Qa; 


SO 
A +027+1+04 =O 
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and 
a +a,+1=0. 
Since a, =a (mod p), it follows that a? +a+1=0 (mod p). 

(3) > (2) Since p 4 3 and a? +a+1=0 (mod p) then a #1 
(mod p). Multiplying with a — 1 we have a® = 1 (mod p). Then 
(aa)? = 1 (mod p). But (a2)?~! = 1, so a? = 1, which is the unique 
(p —1)th root of unity in Z, congruent to 1 modulo p. Since a, #1, 
it is a primitive cubic root of 1, and therefore 


a +a,+1=0. 


So ag +1 = —a? = —a? =a+1 (mod p), and therefore a, +1 = 
Qa41, the unique (p—1)th root of 1 in Z, which is congruent to a+ 1 
modulo p. 


Let us note that in view of Lemma 1.1, condition (1) above is 
equivalent to: 


(1!) For everyn>1:(14+a)?"" =1+a”"™ (mod p"*?), 


According to Lemma 1.2, if p > 3 there exists an integer a, 1 < 
a < p—1, satisfying the equivalent conditions of (1N) if and only if 
p=1 (mod 6). 

As a corollary, we have: 


(10) Ifp is a prime, p=5 (mod 6), then there exists an integer 
No > 0 such that ifn > np anda =1,2,...,p—2 then 


(1+a)" #1+a?” (mod p”*?), 

Proor. By Lemma 1.2, if a = 1,2,...,p—2 then a2+a+1#0 
(mod p). By (1N), for every a there exists an index n(a) such that 
(14a) #140" (mod p™*), 

By Lemma 1.1(5), if n > no = max{n(a) | a=1,2,... ,p — 2} then 


(1+a)" 41+?" (mod p"*’) 


for every a=1,2,...,p—1. 
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It is interesting to find out whether the following implication is 
true: 


(I) Ifa is an integer, a = 1,2,...,p—2, and (1+ a)? =1+a? 
(mod p®) then a? +a+1=0 (mod p). 


Indeed, if (I) is true then the first case of Fermat’s theorem holds 
for every exponent p=5 (mod 6). Because, otherwise by (11) there 
exists a, 1 <a < p—2, such that (1+a)? =1+a? (mod p*). Hence 
by (I), a2 +a+1=0 (mod p) and therefore p = 1 (mod 6), contrary 
to the hypothesis. 

Arwin showed in 1920 that there exist integers a and primes p 
such that (1 + a)? = 1+ a? (mod p”) but a? +a+1#0 (mod p). 
So such a strengthening of the implication (1) is false. 
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V1.2. Divisibility Conditions 


Let p be an odd prime and assume that 2, y, z are nonzero relatively 
prime integers such that x? + y? + z? = 0. In this section we shall 
indicate some divisibility conditions which the integers x, y, z or some 
of their combinations must satisfy. 

The following proposition was given by Pérez-Cacho in 1958. How- 
ever, statement (2) had already been proved by Massoutié in 1931; 
a simpler proof was also given by Pomey in 1931. 


(2A) Let p be an odd prime number and assume that there exist 
nonzero pairwise relatively prime integers x,y,z such that x? + y? + 
zP=0. 

(1) If 3 does not divide xyz then x = y = z # 0 (mod 3), the 
integers x? — yz, y? — xz, z* — xy are divisible by 3, but not 
by 9, and if q is a prime, q 4 3, dividing one of the numbers 
x? — yz, y? — rz, 2 — ry, then q=1 (mod 6). 

(2) Ifp=-—1 (mod 6) then 3 divides xyz. 


Proor. (1) Since 3/ xyz then x,y,z are congruent to 1 or to 
—1 modulo 3. From (+1)? + (+1)? + (£1)? = 0 (mod 38) the only 
possibility is that 2 = y = z # 0 (mod 3). Therefore x? = yz 
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(mod 3), thus 3 divides x? — yz, and similarly 3 divides y? — xz, 2? — 
Ly. 

We note that one, and only one, of the integers x,y,z is even. 
Thus 2? 4.42, 9? Aw2;-2? Ff oy. 

We show that 9 /z?—axy. We have «7? + xP y? +a?2? = 0 and ry = 
(xy — 27) + 2? so —(a?P + xP? 2?) = xP y? = [(ry — 2?) + 27]? = v(zy — 
27)\z2(P-)) + 2°P (mod (z? — xy)?). Noting that x,y,z are distinct 
(since 2 is not a pth power), we have 


Q3(a?, 2?) = 2?P + oP 2? + 2?P = p(z2-—a2y)2z2?-) (mod (z? — zy)?). 


Since 3 | x? — z?, by Chapter II, (3B)(6), we have v3(Q3(z?, z?)) 
= v3(3) = 1. Thus 9 2? — xy. 

Now let q be a prime dividing z? — ry, gq 4 3 (the argument is 
similar if q divides x? — yz or y? — xz). Then q / z (otherwise q | 
z and q|ax or q| y, contrary to the hypothesis). Let z’ be such that 
zz! =1 (mod q). Multiplying with z’?? we have (x?z’?)? + (x?z'?) + 
1=0 (mod gq). By Lemma 1.2, g=1 (mod 6). 

(2) If3 /xyzand p Jz (the argument is similar when p )'x or p /'y) 


then 
2p __ »Pq)P 
2 a vy 
V5, TG. SS 
Q,( b y) 2 = ry 
2p P yP 2p 
x Ce zZ: Zz 
= 7 — ~ = p2*-) = p (mod 2? — zy). 


Also 22 — xy = 27? — xPy? (mod p). If p/ z? — xy then from the 
above congruence, p = 1 (mod z?— zy) so p=1 (mod 3), by (1). If 
p| 2°? — ay, by (1), p=1 (mod 3). 


Pomey claimed in 1931 to have shown that for any exponent p, 
if x,y,z are nonzero and x? + y? + z? = 0 then 3 divides xyz. He 
also claimed in 1934 that 5 divides xyz. However, his proofs were 
erroneous (see Brauer, 1934). 

Inkeri has proved in 1946 the following statement: 


(2B) [fp is an odd prime, p#1,9 (mod 20), if x,y, z are nonzero 
integers such that x? + y? + z? =0 then 5 divides xyz. 


PROOF. By hypothesis, p # 5 and we may assume that x,y,z are 
relatively prime. 
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If5 /xyz then z?, y?, z? are congruent modulo 5 to +1, +2. Chang- 
ing notation, if necessary, we may assume that x? = y? or — y? 
(mod 5). If we had x? = —y? (mod 5) then z? = —ax? — yP = 0 
(mod 5), contrary to the hypothesis. Therefore, x? = y? (mod 5). 

Let h be an integer such that ph = 1 (mod 4). If p= 3 (mod 4) 
we may take h = 3. Raising the above congruence to the power h, we 
deduce that x = y (mod 5). Since z? = —ax? — yP = —2y? (mod 5) 
then again 2°? = 4y?? = —x?y? = (—xy) (mod 5) hence raising to 
the power h, z? = —xy (mod 5). In particular, since x = y = +1 or 
+2 (mod 5) then z? = +1 (mod 5). 

We examine now the integer 27? + «?y?, which is necessarily odd, 
since exactly one of the integers x, y, z is even. 

We show that if q is a prime, g 4 5 and q divides z?? + x?y? then 
q = +1 (mod 5). Indeed, we have the relation (27? + 3y?)? — 5y?? = 
A(x?P + 3aPy? + y??) = A(z °P + gPyP). So for such a prime gq, we 
have the congruences (2x? + 3y?)? = 5y?? (mod q). So 5 is a square 
modulo q. By the quadratic reciprocity law, 


1 (2) = (2) thus g=+1 (mod 5). 


It follows that any factor k of z?? + xy? which is not a multiple of 
5 must be congruent to +1 (mod 5). In particular, we take 


ates ary? 2(p—1) 2(p—2) 


= = 2(p—3) Oe nade p-l 
k aay % Zz rytz (xy) + (xy). 
Since z? = —ay (mod 5) then k = pz?®-) = p (mod 5) because 
z? = +1 (mod 5). Thus k # 0 (mod 5) and therefore k = +1 


(mod 5). 

We conclude that p = +1 (mod 5). 

We have still to show that p = 1 (mod 4). If we assume that 
p = 3 (mod 4) then 3p = 1 (mod 4) hence z = z*? = (—2)%y*? = 2y 
(mod 5). Considering the relation (22? + 3y?)? — 5y?? = 4(2°? + 
Sy? z? + y?P) = A(x? + yPz?) we deduce as before that every prime 
factor gq #5 of x7? + y?z? must be congruent to +1 (mod 5). Hence 
every factor k of x?? + y?z?, k #0 (mod 5), must be congruent to 
+1 (mod 5). 

In particular, taking k = x? + yz, if p = 3 (mod 4) we have 
y = —2z (mod 5) hence x? = y? = —2yz (mod 5) thusk = 7?+yz = 
—yz £0 (mod 5). Therefore k = +1 (mod 5). However, on the 
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other hand k = x2? 4+ yz = y? + 2y? = 3y? = +2 (mod 5), which is a 
contradiction. This shows that p= 1 (mod 4). 

Therefore, p = 1 or 9 (mod 20), contrary to the hypothesis, show- 
ing that 5 | xyz. 


Pérez-Cacho proved in 1958: 


(2C) [fp is an odd prime number, if x,y,z are nonzero relatively 
prime integers such that x? + y? + z? = 0 then: 
(1) ifq¢ 45 is a prime factor of (x? — yz)(y? — zx)(z? — xy) then 
q = +1 (mod 10); and 
(2) none of the numbers x? — yz, y® — zx, 27 — ry is a multiple 
of 25. 


Inkeri also proved in 1946 the following result: 


(2D) Let p be an odd prime number, and assume that x,y,z are 
relatively prime nonzero integers such that x?” +y”" +2?" = 0 (where 
n>1). Then: 
(1) 5 divides xyz(x — y)(a — z)(y — 2). 
(2) 7 divides xyz(a — y)(x— z)(y — z)(a? — yz)(y? — xz) (2? — zy) 
(if p > 3). 
(3) 11 divides xyz(x — y) (x — z)(y —z) (a? +yz)(y? + 2z)(27 + ry) 
(if p > 5). 


PrROoF. First we note that if / is a prime and p // — 1, if a,b are 
nonzero relatively prime integers then / | a+ if and only if 1 | a?+b?. 
Indeed, if a or } is a multiple of J, it is obvious. 


Let | /ab. 

It is clear that if 1 | a+6 then /| a? + b?. Conversely, if 1 | a? + b? 
but | /a+b then / is a primitive factor of a? + b? and by Chapter II, 
(3G), /=1 (mod p), which is a contradiction. 

Let u = x2?", v = y”", w = 2”", so we have u+v+w =0 and we 
need to show (in view of the above remark, because p > 11): 

(1) 5 divides uvw(u — v)(u — w)(v — w). 
(2) 7 divides uuw(u—v)(u—w)(v—w) (uw? -—vw) (v?—uw)(w? —uv). 
(3) 11 divides 

uvw(u — v)(u — w)(v — w)(u? + vw) (v? + uw) (w? + uv). 
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Let 1 = 5,7, or 11. 
Since gcd(u,v,w) = 1, we may assume, for example, that | | v. 
Let v’ be such that v’v = 1 (mod 1). Hence multiplying with v’ we 


have t+t’ = 1 (mod J), where t = —v’u (mod J), t/ = —v’w (mod J). 
Let 

P=, 

ae 

Ts P40, 

Tori, 

Ts =t—2, 

TS Peat, 

Tt = +t-1, 

T, = ? —t—-1, 


Ty = ? —3t+1. 
Then 
vT, = —u (mod J), 
vTy = —u—v=vw (mod J), 
vT3 = —u+t+v (mod J), 
vl, = —2u—v = —u+w (mod J), 


uvTs = —u—2v=w-—v (mod J), 


vTs = w+ouuty? =u? — vw 

=v? — uw =w? — vu (mod J), 
VT, = Ww -—vu-—v? = wu? + vw (mod J), 
vTs = vw +vu-—v? = —-v? — uw (mod J), 


VTy = uw? + 30ut+ v0? = w? + uv (mod J). 


(1) Let 1 = 5, then it is easy to verify: if 


(mod 5) then 5] 7}, 
(mod 5) then 5] Ts, 
(mod 5) then 5] Ts, 
1 (mod 5) then 5 | 73, 
2 (mod 5) then 5| 7%, 
) 


a. a a 
HW ET 


hence 5 divides wuww(u — v)(u — w)(v — w). 
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(2) Let / = 7; then it is easy to verify: ift = —3, —2, -1, 0,1, 2,3 
(mod 7) then 7 divides T,, Ts, T3, T1, T2, Ts, Te, respectively. Hence 
7 divides wuw(u — v)(u — w)(v — w)(u? — vw) (v? — uw) (w? — vu). 

(3) Let 1 = 11; then if t = —5, —4, -3, —2, —1, 0, 1, 2, 3, 4,5 
(mod 11) then 11 divides T,, T7, Tg, Ty, T3, T,, T2, Ts, T7, Ts, To, re- 
spectively. So 11 divides uwvw(u — v)(u — w)(v — w)(u? + vw)(v? + 
uw)(w? + wv). 


Let us note that the argument breaks down for / > 11. Indeed, in 
this case if t = —2 (mod 1) then the values modulo | of T; (1 <i < 9) 
are distinct from 0 and have absolute value at most equal to 11. 

Concerning divisibility by 4, we indicate a result of Pierre (1943), 
preceded by a lemma on Jacobi symbols. 


LEMMA 2.1. Let a,b,c be pairwise relatively prime odd integers such 


(2)-(2)-(g)-# 


Then at most one of the numbers a,b,c is congruent to 3 modulo 4. 


Proor. Assume that a = b = 3 (mod 4). Then by the reciprocity 
law for Jacobi’s symbol: 


al ae cy alo) 
a glo. 

Gaol 

Hence (a/b) = (a/c) = (b/c) = (b/a). However 


(8) = (G)icumamenn a) 


which is a contradiction. 


1 


(2E) Let p be an odd prime number, and let x,y,z be nonzero 
relatively prime integers such that x? + y? + z? =0. Then 4 divides 
one of the numbers x, y, z. 
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PROOF. We may assume z even, while x, y are odd; we also assume 
z =2 (mod 4) to derive a contradiction. 


First Case: p xyz. 


By (2.1), 
Dp P 
oS B+ & 
Z+ux 
= 2P-h — gP Pg 4... $ z2gP 3 — gph? 4 zPol 
= —27?-24 2°"! (mod 4). 


—2(4k + 1)?-* +1 = #241 = 3 (mod 4), and therefore 5, 
(mod 4). Similarly, r? = (2? + y?)/(z+ y) = 3 (mod 4), so ry 
(mod 4). 

By Chapter II, (3A), (71 /sit1) = (s1/riti) = (t:/s171) = 1, hence 
by Lemma 2.1 at most one of the numbers r,, s;,t,; may be congruent 
to 3 modulo 4, which is a contradiction. 


Since x is odd then x?~' = 1 (mod 4) hence s? = —2x?-? +1 


Second Case: p| xyz. 

First we consider the case when p | z. As in the first case, s; = 3 
(mod 4), r; = 3 (mod 4). 

By Chapter III, (2A), the odd integers r,, 51, pt; satisfy (1, /pt,s1) 
= (s,/ptir:) = (pti/ris1) =1. This contradicts Lemma 2.2. 

Now we assume that p J) z and, for example, p | x. Proceeding 
as before, by (1.2) we have pr? = 3 (mod 4) and since r; is odd 
then r?~' = 1 (mod 4), so pr; = 3 (mod 4). Moreover, s; = 3 
(mod 4) and by Chapter III, (2A), the odd integers pr,, 51, t; satisfy 
(pri /siti) = (81/priti) = (ti/prisi) = +1, and this contradicts 
Lemma 2.1. 


In 1910, Lind claimed that 9 divides x + y+ z, but his proof 
was insufficient. As a consequence there are several inequalities and 
equations in his paper which are questionable (see Dickson, 1920, p. 
769). 

After these divisibility results by small numbers, we turn our at- 
tention to divisibility results by expressions built from the numbers 
x,y,z which are hypothetical solutions of Fermat’s equation. 

In 1913, Niewiadomski showed: 
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(2F) Let p be an odd prime number, and let x,y,z be nonzero 
relatively prime integers, such that x? + y? + 2? =0. Then x?P+1 + 
y2Ptt 4 22P+l is divisible by (x + y)(yt z)(z4+2). 


PROOF. We have 


prety yprtt 4 got = (SAY ee **) ‘ 
xP + yP  yP+ zP 0 ZP + xP 


as may be easily verified. Hence 


BP yee YP Ae) gP ae). (oP ey) ee 2?) 
(c+ y)\(yt+z)(z+ 2) (y+z)(z+2) (z+ y)(z+2) 
(a? + y?)(y? + 2?) 
(z+ y)(yt 2) 


and this number is an integer. 


The following result of Rameswar Rao (1969) is also very simple: 


(2G) [fp is an odd prime number, if x,y,z are nonzero pairwise 
relatively prime integers such that x? +y?+z? =0, then x+y divides 
d? where d = gcd(a+ y, z) (similar statements hold by symmetry for 
rtz,y+2). 


PROOF. From z+ y+ z= 0 (mod p), there exists k #4 0 such that 
xc+y+z2= kp. Since p is odd then x + y = kp — z divides both 
xP + y? = —z? and (kp)? — z? so kp — z divides (kp)?. Since d = 
gcd(kp, z), we may write kp = ud, z = vd with gced(u,v) = 1. But 
u—v divides both u?d?~! and v?d?~! hence (u—v)d = x+y divides 
d?. 


The following result, which appeared in a paper by Simmons 
(1966) was attributed to G. Reis; the assertion (1) was proved again 
by Rollero (1981): 


(2H) Let p be an odd prime number, and let x,y,z be pairwise 
relatively prime positive integers such that x? + y? = z?. Then there 
exist uniquely defined positive integers k,a,b such that x = k+a, y= 
k+b,z=k+a+t+b. 

Moreover: 
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(1) pab divides k?. 

(2) ged(a, b) = 1. 

(3) Ifa #1 then gced(k,a) £1; if p Va then gcd(k, a) 4 a. 
(4) Ifb #1 then gcd(k, b) 4 1; if p /b then gcd(k, b) F b. 


PROOF. We may assume 0 < 4 < y < z. Since z < 4+ y, let k be 
defined by x+y =2+hk. From y < z we havek <a < y. Let a,b 
be defined by x=k+a,y=k+b. Soz=k+artob. 

It is clear that k, a, b are uniquely defined: if 2 = k’+a’, y= k’+0’, 
and z= k’+a’+0' then 0 = (k—k’)+ (a-a’) = (k-F’) + (8-0) = 
(k—k’)+ (a-a@’)+ (6-0) sok=k',a=a',b=0'. We have 


(2.1) (k +a)? +(k +b)? =(ktatby). 


It is easily seen that gcd(k + a,k + b,k + a+b) = 1 because 
gcd(x,y,z) =1. Therefore k+a, k+b, k+a+b are pairwise relatively 
prime. 

Since (k+a+b)? = (k+a)?+p(k+a)?—'b+---+0? then (k+6)? = 
p(k + a)®-1b +--+ 0” hence k? + pk?-"b + (2)k?-7b? +--+ bP = 
p(k + a)?-'b+-+++ 6? so 


bP = pol(k +a)? — 4 ) ((hk+ a)? — PB 


wo ( - Jur 
p-l 


Each bracketed expression is a multiple of a, thus k? is a multiple of 
pab. Ifa 4 1 then gcd(k, a) £1, if b # 1 then gced(k, b) 4 1. We have 
gcd(a,b) = 1 for if a prime q divides a,b then it would also divide k, 
hence & +a, and k +0. 

If p a then gcd(k, a) € a otherwise k = al, | an integer, and from 
(2.2) we have, after dividing by a, 


a? = pha? *[(1 + 1-1 — P+ () Pelle Pe 
tee. + pbPt, 


Hence a divides pb?-'. But p a hence a divides b?~', a contradic- 
tion. Similarly, if p /b then gcd(k, b) 4 b. 
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Let p be an odd prime and assume that x, y, z are nonzero pairwise 
relatively prime integers such that x? + y? + z? = 0. 

Fleck (1909, 1910) began a more systematic study of divisibil- 
ity properties of the following numbers (built from the hypothetical 
solution (x, y, Z)): 


A=ytyz+2’, 
B= 2t+zr+2', 
C= a t+ayty’, 


A, = 2? — yz, Ap = 27+ yz, 

B, = yy’ — 22, By = y? + 22, 

C, = 27- xy, Cy = 27+ 2y, 
S=atytz, T= —-(xyt+yz+ 22), 
U = xyz, QV = a? ty? 4 27. 


Let r,s,t be defined as Barlow relations (Chapter III, §1). 


(21) With the above hypotheses and notations, there exist nonzero 
integers G, M, J, K, L, J), Ky, 11 such that 


(1) 


ge { —rstp’GM (in the first case), 


—rstp°GM (in the second case). 


(2) A — GJ,B — GK,C a GL, A, — GJ,, By — Gky,C, = 
GIy. 

(3) G is the greatest common divisor of S and the six expressions 
above. 

(4) J, K, L, Ji, K,, Ly are pairwise relatively prime. 

(5) The prime factors of J, K, L are of the form 6hp +1. 

(6) The prime factors of J,, K,, L, are of the form 6hp? +1. 

(7) 2? = y*? = 2°? (mod GJKLJ,K,1). 


4 


In 1979, Inkeri gave corrections to some proofs of Fleck and ob- 
tained further results along the same lines. 

We conclude the section with a result of Pollaczek, obtained in 
1917. No elementary proof is known for it. Pollaczek’s proof was 
based on congruences obtained by Kummer in 1857, which should 
hold if the first case of Fermat’s theorem is assumed false for the 
exponent p. Kummer’s proof of these congruences involves a detailed 


194 VI. Arithmetic Restrictions 


consideration of arithmetical properties of the cyclotomic field Q(¢,), 
where ¢, is a primitive pth root of 1. It is therefore not included in 
this book. 

Here is Pollaczek’s result: 


(2J) 


Let p be an odd prime number and assume that there exist 


pairwise relatively prime nonzero integers x,y,z, x? + y? +2? =0. 
Then A=y?+yz+27, B= 22+224+ 27, andC=272+2y+y? are 
not divisible by p. 


1857 


1909 


1910 


1910 


1913 


1917 


1920 


1931 


1931 


1934 


1934 


1943 
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VI.3. Abel’s Conjecture 


Abel stated in 1823 that if x, y, z are nonzero relatively prime integers 
such that 0 < 4 < y < z and 2”+ y” = 2” (n > 2), then none of 
X,Y,z are prime-powers. No direct proof of this statement has ever 
been discovered. However, we shall see that it is correct when n is 
not a prime number or when n = p is a prime not dividing xyz. 
This last assertion will not be proved in this book, since it requires 
analytical methods. 

Many partial results, obtained by various authors, are summarized 
in Table 7. 

In 1887, Mansion claimed to have shown that if the exponent is 
an odd prime that x is not a prime. His proof was erroneous. In 
1891, Lucas published a proof that if n is arbitrary then x cannot 
be a prime-power; but his proof was incomplete, as pointed out by 
Markoff in 1895. In 1955, Moller established a theorem containing 
all the above results. However, the proof on page 27 of his paper 
was insufficient. We give below a simpler and correct proof: 


(3A) Letn>3 be an odd integer with r distinct prime factors. If 
1l<a<_y are relatively prime integers anda = y" +2", b= y"—2x", 
then: 


(1) a and b have at least r distinct prime factors; 
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(2) Ifa has exactly r distinct prime factors then a = 23 + 1° (so 
fh] 3, = Wand 
(3) If b has exactly r distinct prime factors then y= x+1,b= 


(a+1)"— 2". 
TABLE 7 
Year Author Exponent n >2_ Result 
1857 Talbot arbitrary (I) y, z are not primes 
(II) if x is a prime then 
z-y=1 
1884 Jonquiéres arbitrary (I) and (II) 
1887 Borletti odd prime z is not a prime 
even x,y,z are not primes 
1901 Gambioli arbitrary (II) z not a prime-power 
not a power of 2 
1905 Sauer arbitrary y, Z are not prime-power 
1932 Mileikowsky arbitrary y, Z are not prime-power 
not a prime (III) x not a prime-power 
1949  Izvekoff arbitrary (I) 
1952 Bini odd z not a prime-power nor 
equal to nq, q prime, 4 n 
odd prime z is not a multiple 
of n, z is not equal to 
q192°°* qr where q 
< qo < +++ <q, are prime 
and gq? > 2q192°+- Qr- 
ProoFr. (1) Let p1,..., p, be the distinct prime factors of n. Since 


yx # 0, by Chapter II, (8B)(3), y” £2" = (y+ 2) - Qrly, F2) 
is a multiple of (y+ x) []j_, Q,,(y, #2). By the same result, Part 
(2), the integers Q,,(y, Fx) (for i=1,...,1r) are pairwise relatively 
prime. 

We observe that 2 + y* > x+y and (2 — y™)/(4 —y) = 
gir} 4 gPim2y 4... + yPi-1 > 1. So each Q,,(y, FX) has at least one 
prime factor. Hence, a,b have at least r distinct prime factors. 

(2) Let m be the product of the r distinct prime factors of 
n, som |n. First we show that n = m. We have a = (y™ + 
x™)(y” + 2”) /(y™ + 2”). By (1) and the hypothesis, y” + 2” has 
exactly r distinct prime factors, which are the same as those of a. If 
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n > m then a has no primitive factors. By Chapter II, (3J), we have 
a = 22+ 13, so n =3 =™M, which is a contradiction. Thus n = m, 
i.e., n is the product of r distinct primes. 

Let r=1,n=p,soa=(y+2)Q,(y, —x). By hypothesis, a is 
the power of some prime g. Since q divides y+ and Q,(y, —x) > 1 
then by Chapter II, (3B)(4), ¢ = p; by Part (6), Q,(y, —z) = p. By 
Chapter II, (3D), then p = 3, y= 2, 4 = 1. 

Let r > 1, n = ph, p being a prime, p/h; so h is the product of 
r — 1 distinct primes. From a = (y" + x2")Q,(y”, —x”), by (1) and 
the hypothesis, y” + 2” has r —1 or r distinct prime factors. By 
induction on r, if y” +2” has exactly r — 1 distinct prime factors 
then =" 3, y= 2 = 1 So =" Sp with Sa), r= 2; and 
23 41 = (2? + 1)Q3(2?, -1). 

If there exists a prime g 4 3, such that q | 2? +1 then by Chapter 
II, (3B)(6), vg(Q3(2?,—-1)) = v,(3) = 0, so q  Q3(2?,—1). Since 
r = 2 and 3 | 2? 4+ 1 then Q3(2?,—1) is a power of 3; again by 
Chapter IT, (3B)(6), Q3(2?, —1) = 3, so by Chapter II, (83D), 2? = 2, 
which is impossible. 

If 2? + 1 = 3° then 3° = 1 (mod 8), because p > 3. Thus s = 2s’. 
So. 2? = 3° il = (87-41) (6% =a) hence3* 19 a 1 ae 
with p—c > c > 0. Taking the difference 2 = 2?~°—2° = 2°(2?-*°—1), 
soc=1, p—2c=0, i.e. p = 2, which is impossible. 

If y” + 2” has exactly r distinct prime factors, then each prime 
factor q of Q,(y",-x") divides y” + 2". By Chapter II, (3B)(4), 
then g = p; Q,(y", —x") > 1, therefore it is a power of p; by Part (6) 
of the same proposition, Q,(y",—x”) = p and by Chapter II, (3D), 
p = 3, y” =2, x*® = 1,80 h=1, n =3, concluding the proof. 

(3) Ifb=y"—2” has exactly r distinct prime factors, by Chapter 
II, (8B)(2) and (3), y”—2” is a multiple of (y—2) [T;_, Qp,(y, x) and 
the integers Q,,(y, xz) > 1 are pairwise relatively prime, hence by the 
hypothesis, they are prime powers. If y— ax > 1 and q is a prime 
dividing y— x then there exists i such that q¢ | Qp,(y, x). By Part (4) 
of the same proposition, g = p; and by Part (6), Q,,(y,v) = p;. By 
Chapter I, (3D), p; = 3, y = 2, 2 = —1, which is a contradiction. 
This proves that y=a+1. 


As a corollary, we have: 


(3B) Letn>3 be a positive integer having r distinct odd prime 
factors. If[0 <a <y < z are relatively prime integers such that 
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xr’ +y" = 2", then z,y have at least r+1 distinct prime factors and 
x has at least r distinct prime factors. Moreover, if x has only r 
such factors then n is odd and z—y = 1. 


PROOF. By Chapter I, n is not a multiple of 3, nor a power of 2, so 
r>1. Let n= 2"m, with u > 0 and m odd, having r distinct prime 
factors. Let 2; = 2? ,y, = y* , 21 = 27. Since 2° = 2? =a +y™ 
and y” = yf’ = 27" — yf" and since m # 3 and z, — x, > 1, it follows 
from (3A) that 2",y", hence also z,y, have at least r+ 1 distinct 
prime factors. 

Similarly «” = x27 = zy’ — yy", so x”, hence also x, has at least 
r distinct prime factors. If x has only r such factors then by (3A), 
2=y+1,ie, 27 =y? +1. Sou=0,z=y4+1 and n odd. 


More explicitly: 


(3C) Letn>2, andlet0<u<y< z be relatively prime integers 
such that x” + y” =z". Then: 


(1) y,z are not prime powers; and 
(2) if x is a prime power then z= y+1 and n is an odd prime. 


Proor. (1) If z or y is a prime power, so is 2” = 2" + y", or 
y” = 2" —«”. By (3B), n is a power of 2,n > 4, and this contradicts 
Fermat’s theorem, which is true for such exponents. 

(2) If a is a power of a prime g then by (3B), z = y+ 1 and 
n= p*,e>1, pan odd prime. We show that e = 1. 

ee Sl then since 2?—y? > Land Qpe- 1(2P, yP) = (2? —yP) /(2?— 
y?) > 1, it follows from x? = 2?° — yP° = (2? — yP) - Qye-1(2?, y?) 
that Botli factors in the right-hand side are powers of q, greater 
than 1, hence multiples of g. By Chapter II, (3B)(4), q divides 
ged(z? — y?, Qpe-1(2”, y?)) = gcd(p**, z? — y?), hence q = p. 

On the other hand, since z = y+ 1 then z? — y? = py?! + 
Cyr e+e + (vy +1, s0 p=q does not divide z? — y?, which is 
a contradiction. 


Moller proved the following complement to (3A): 


(3D) Ifm is odd, m > 3, with r distinct prime factors, if0 <xa<y 
are relatively prime integers, then b = y?™ — x?” has at least 2r +1 
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distinct prime factors. 


PrRooF. b = (y” — 2™”)(y™ + 2™) and since gcd(z, y) = 1 then d = 
gcd(y™ — a™,y™ +2”) = lor 2. By (3A), y” —x2™ and y™ + «™ 
have at least r distinct prime factors. Since m > 3, by (3A), y™+2 
has at least r+ 1 distinct prime factors. 

If y” —a”™ has exactly r distinct prime factors, by (3A), y= x+1, 
thus y™” — x” is odd. Hence d= 1 and 6 has at least 2r + 1 distinct 
prime factors. If y™ — x” has at least r +1 distinct prime factors, 
since d = 1 or 2, then b has at least 2(r + 1) — 1 = 2r 4+ 1 distinct 
prime factors. 


In particular, if m is odd, m > 3 with r distinct prime factors, if 
0<a<y< zarerelatively prime integers such that 7?”"+y?™ = 22” 
then x, y have at least 2r +1 distinct prime factors. This was shown 
by Moller in 1955. Combining with (3B) it follows, with the above 
exponent 2m, that x,y,z cannot be prime powers. 

Inkeri showed in 1946 that if0 <a < y < z,ifpisa prime number, 
p/axyz and x? + y? = 2”, then z—y > 1. Hence by (3C), z is not 
a prime power. This provided a direct proof of Abel’s conjecture in 
the case when p / xyz. No direct proof is known, when p | xyz, that 
x is not a prime power, so no direct proof that z > y+ 1 has been 
devised. 

In 1886, Catalan examined the implications of this eventuality. 


(3E) Let p be an odd prime number, and let 0 < x < y be integers 
such that x? + y? = (y+1)?. Then: 

(1) py(y +1) divides x? — 1. 

(2) py, pl a—1. 

(3) If q is a prime dividing y+1-—<«x then q divides x —1. 

(4) ged(x+y, y+t1—2)=1. 

(5) ged(2x — 1, 2y+1) = 1. 

(6) x is the only integer such that 


(py)? <x < (p(y+ 1)”. 


ProoFr. (1) The polynomial (Y +1)?—Y?—1 is a multiple of p, Y, 
and Y +1, hence x? —1 = (y+1)?—y?—1 = py(y+1)h, h an integer. 
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(2) Since p| 2? —1 then px. From 2? = x (mod p) it follows 
that p|x—1. 

(3) y? = (y+ 1)? — a2? = (y+1-—~2)k, where k is an integer; 
hence if g| y+1—<2 then q|y,soq|a—-1. 

(4) If q is a prime dividing x + y and y+1—<2 theng|a-1 
so q | y, hence q | x; but from x? + y? = (y+ 1)? it follows that 
gcd(x,y) = 1, a contradiction. 

(5) If q is a prime dividing 2x — 1, 2y + 1, then it divides their 
sum 2(a + y) and their difference 2(y — x + 1); but q is odd, so 
q|y—x+1, q|a+1, which is a contradiction. 

(6) We have 


(Ela 
(y+)—y 
=e Ger dP yi eretlyse Dyke EP 


sillier EenelS me ao 


hence 
pes ae py se Ps 
and this yields the inequalities of the statement. 
Finally, we note that if 7,, x2 are integers such that 


- eand 
(py? 1)? < ay < ao < (p(y +1)P2)” 
then 
1/(p-1) 1/(p-1) 
x x 
yY< ay (*) and Lo (=) <yt+l, 
hence 
1/(p-1) 1/(p-1) 1/(p-1) 
x x x 
Ge —_ x1) (=) < v2 (=) — (=) < 1. 
Pp p Pp 


But 1 < x2 — a, and p = (p?)'/” < (py?-1)/P < a, since by (2), 
p<a—-1<a<y. Therefore 1 < (2/p)/®-) and so 1 < (x2 — 
2)(2/p)'/@-), which is a contradiction. 


We now shall use Barlow’s relations of Chapter III, §1, with an 
obvious change of notation, since we consider the relation x?+y? = z? 
(with 0 < x < y < z) instead of x? + y? + z? = 0. In particular, we 
use the integers 7, s,¢ defined in those formulas. 

In 1964, Dittmann proved the following facts: 
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(3F) Let p be an odd prime, and let 0 < x < y be integers such 
that x? + yP = (y+1)? (hence x,y are relatively prime). Then: 


(1) fp| yt then v,(—s + 1) = v,(y +1) -1. 
(2) Ifp| y then v,(t — 1) = v,(y) — 1. 


Proor. (1) Let z= y+1 and z—2 = —s?, so s 4 —1. By Chapter 
III, (1C), n = v,(y +1) > 2 and p” divides z and z+ y. So p” 
divides 2z — (a+ y) = z-—x+1 = —s?+41. Since s? = s (mod p) 
then p | —s+1. Let v,(—s+1) =1> 1. By Chapter II, (3B)(6), we 
have n < v,(—s? +1) =14+1,sol>n-1. 

If v,(—s+1) > n then by the fact just quoted above, v,(—s?+1) > 
n+ 1, hence p”*! divides —s? + 1 = 2z — (x+y). By (1.2), p?”-! 
divides x+y, hence p"*! divides 2z, so p"*' | z = y+1, contrary to 
the hypothesis. This shows that v,(—s +1)=n-—1. 

(2) Let v,(y) =n, so by Chapter II, (3B), n > 2 and by (1.2), p” 
divides y and y+1—ax = 2y—(x+y-—1). So p” divides x+-y—1 = t?-1. 
Since t? =t (mod p) then p divides t—1. Let v,(t—1) =1 > 1; then 
by Chapter II, (3B)(6), n < v,(t? —1) =141, hence 1 >n-—-1. 

If v,(t — 1) > n then v,(t? — 1) > n+1 0 p"*! divides t? —1 = 
2y —(y+1— <2); but p?”~' divides z — 2 = y+1-—2, therefore p”*! 
divides 2y, so p"*! | y, which is a contradiction. 


The situation covered in the preceding result cannot yet be ruled 
out by a direct proof. With other methods, it may be shown that 
under the hypothesis of (3E), necessarily p divides y or y+ 1. 

In his thesis, Dittmann has also studied the possibility of a solution 
of Fermat’s equation, with y = «+ 1. He showed: 


(3G) If p is an odd prime number, if there exist positive integers 
0 <a < z such that x? + (2+ 1)? = 2”, then: 

(1) p| a(a@+4+1). 

(2) Ifp|a+1 then —r = [—p"-/?s] and r =1 (mod p). 

(3) If p| x then —s = [—p"-/?r] + 1 and —s =1 (mod p). 


ProoFr. (1) Ifp)a2(#+1), by Chapter III, (1C), z-— «= —s? and 
z—(#+1) =-—r? so 1 =r? — 8, which is impossible, since r,s are 
not zero. 

(2) Ifp|a+1 then p/ xz, so by Chapter III, (1C), z-2 = 
—pP?-1sP z — (a +1) = —r?; hence 1 = r? — p??~'s?, sor = 1 
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(mod p) and —r? < —p?"-1sP, so —r < —p™—1/Pg, 
If —r < —p"-/?s—1 then r > p®—/?s +1, hence r? > pP?-'s+1, 
which is a contradiction. This shows that —r = [—p"—1/?s]. 


(3) 


If p | « then p (a+ 1)z, so by Chapter III, (1C), z -—2 = 


—s?, z—(x+1) = —pProtrP; hence 1 = p??—!r? — s?, so —s = 1 
(mod p) and —s? > —p?"-1rP, hence —s > —p"—1/Pr, 
If —s > —p*-/Pr +1 then s+1 < p”-/?r, hence s? +1 < p??—1p?, 


which is a contradiction. This shows that —s = [—p"~!/?r] + 1. 
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VIL4. The First Case for Even Exponents 


In this section, we give Terjanian’s proof (1977; see also 1978) that 
the first case of Fermat’s theorem holds for even exponents. It suffices 
to consider exponents 2p, where p is an odd prime. The proof will 
require only elementary considerations, so it is surprising that it was 
not found beforehand. 

Several authors had considered Fermat’s equation with even ex- 
ponents, however their direct proofs are now all superseded by Ter- 
janian’s; yet we shall quote, and in a few cases prove, some of these 
statements. 

To begin, in a letter to Gauss (1804), Sophie Germain stated with- 
out proof that if p is a prime, p = 7 (mod 8), then X?-!+ Y?-! = 
Z”~' has no solution in nonzero integers. 

In the first of a long series of papers on Fermat’s theorem, Kum- 
mer proved (1837) the following result, which was rediscovered by 
Niedermeier in 1943 and again by Griselle (1953) and Oeconomou 
(1956). We follow Griselle’s proof: 


(4A) Letn > 2 be an integer. If there exist nonzero integers x, y, z 
such that x?” + y?” = 2?” and gcd(n, xyz) = 1, thenn =1 (mod 8). 


PROOF. By Fermat’s theorem for the exponent 4, we may take n 
odd, n > 3. We may also assume that x,y, z are pairwise relatively 
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prime positive integers, and that x is even, while y, z are odd, by the 
remark at the beginning of Chapter I, §1. Then 


en = yen — yr” 


H (22 — y2) (22D 4 y2z22) pp y2l0-2) 22 4 y(n) 
= (2 —y")- Qn(2",y"), 
with the notation of §1. By Chapter II, (3B)(4), 
ged(z” — y”, Qn(z*, y*)) = ged(z? — y*, n). 


If p is a prime dividing n and z? — y? then a fortiori p divides z, 
contrary to the hypothesis. Thus z?—y? and Q,,(z”, y”) are relatively 
prime. 

So z* — y? and Q,,(z?,y”) are 2nth powers. Moreover, 2” — y? is 
even so Q,,(z”,y”) is odd. Thus there exists an odd integer k such 
that 


g2ln-l) as grln—2)ay2 a gina Be koan he ge} = ke”. 


Each term of the above equality is an odd square, so it is of the form 
(2a+1)? = 4a(a+1)+1=1 (mod 8). Thus, we have the congruence 
n= 1 (mod 8). 


From this result, it follows: 


(4B) The set of primes p such that the first case of Fermat’s 
theorem holds for the exponent 2p is an infinite set. 


PrRooF. According to Dirichlet’s theorem for primes in arithmetic 
progressions, there exist infinitely many primes p satisfying each of 
the congruences p = 3 (mod 8), p=5 (mod 8), p=7 (mod 8). For 
each such prime p the first case of Fermat’s theorem holds for the 
exponent 2p, by virtue of (4A). 


We note the following strengthening of (4A); statement (1) is due 
to Niedermeier (1944), while (2) was given by Grey (1954): 


(4C) Let p be an odd prime and assume that there exist nonzero 
relatively prime integers x,y,z such that x7? + y?P = 27? : 


(1) If 3p does not divide x,y,z then p=1 (mod 3). 
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(2) If 2p does not divide x,y,z then p = 24a+1 (for some inte- 
ger a) and 12a+1 has no factor congruent to 3 modulo 4. 


Another improvement over (4A) is the following ((1) was proved 
by Niedermeier in 1944, while (2) was given by Long in 1960): 


(4D) Letm> 3 be an integer and assume that there exist nonzero 
integers x,y,z such that 2?" + y?” = 27”: 
(1) a m =p is a prime and if 5p does not divide x,y,z then 
=+1 (mod 5). 
(2) if gcd(m, ryz) = 1 then m= 1 or 49 (mod 120). 


As a corollary: 


(4E) If n = 2m has last digit 4 or 6 (when written in deci- 
mal notation), then there exist no nonzero integers x,y,z such that 
gcd(m, xyz) = 1 and 2” + y” = 2”. 


PROOF. We have m = +2 (mod 5), hence m ¥ 1 or 49 (mod 120) 
and the result follows from (4D). 


Long (1961) extended his method and proved: 


(4F) [fp is a prime, p= —1 (mod 10) and if there exist relatively 
prime integers x,y,z, prime to p and such that x7? + y?? = z??, then 
p is a square modulo 11, and consequently p = 49, 169, 289, 529, or 
889 (mod 1320). 


Oeconomou proved in 1956 the following results, involving the 
Legendre symbol: 


(4G) Letn>1 be an odd integer and assume that there exists an 
odd prime q such that: 
(a) ged(q—1,n) = 1. 
(b) (n/q) = -1. 
(c) [f0<m< (q—1)/2, with n =m (mod (q—1)/2), if a,b, 
are integers such that a?” +b?” = c?” (mod q), and 


2 2 25 22 2 f2 
(**) =1, (¢ “) =1, (¢ =) =1, 
q q qd 
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then abc = 0 (mod q). 


Then there exist no relatively prime nonzero integers x,y, z such that 
gcd(n, ryz) = 1 and 22” + y? = 27”, 


There are numerous possible choices of g,m satisfying the condi- 
tions. For example: In any of the following cases, the first case of 
Fermat’s theorem holds for the exponent 2n: 

e) n =-—1 (mod 3). 
+2 (mod 5). 
(n/i1) = —l and n=4 (mod 5). 
(n/19) = —1 and n=4 or 7 (mod 9). 
(e) (n/23) = —1 and n = 2, or 3, or 4, or 10 (mod 11). 
(f) (n/29) = -1 and n =3, or 5, or 9, or 11 (mod 14) etc. .... 


In this way, Oeconomou proved that the first case of Fermat’s 
theorem holds for all even exponents less than 200 000 (with the 
possible exceptions of 108 722 and 188 018). 

Gandhi proved in 1966: 


(4H) Let p be a prime, p > 5. If x,y,z are pairwise relatively 
prime integers such that x?-! + y?-' = z?-! then z is odd, and p 
divides the one among the integers x,y which is even, hence p does 
not divide z. 


PROOF. Since p—1 is even, by the remark at the beginning of Chap- 
ter I, §1, z is odd and x,y have different parity, say x is odd, y is 
even. 

If p / zy then x?~' = y?-! = 1 (mod p) while z?~' = 0 (mod p) 
or = 1 (mod p). Thus 2 = x?~! + y?-! = 0 or 1 (mod p) which is 
impossible. Let us assume that p | x, p / y — this will lead to a 
contradiction. Let m = (p— 1)/2. We have 


aP 1 _ yp tae b= (2™—y™)\(2z™ + y™). 


Then the two factors in the right-hand side are relatively prime, since 
they are both odd. Therefore each factor is a (p — 1)th power: 


gm fo y™ = apt 

Bey er, 
hence x = ab and also 2z” = a?~'+b?-!. But p| x and gcd(a, b) = 1 
so p divides one and only one of the numbers a,b. Hence, 22” = 
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1 (mod p) and squaring this congruence, 4 = 1 (mod p), which is 
impossible. 
This proves that p divides y. Hence p does not divide z. 


In 1969, Raina proved: 


(41) Let p be a prime, p > 5. If x,y,z are positive pairwise 
relatively prime integers such that x?~' + yP-' = z?-! then z is a 
quadratic residue modulo p. 


PROOF. Only one of x, y is even, the other is odd. Suppose y is even. 
By (4H), z is odd and p divides y. 

Let m = (p—1)/2. From (2™)? + (y™)? = (2™)?, it follows by 
Chapter I, (1A), that there exist positive integers a,b, of different 
parity, such that gcd(a,b) = 1 and 


gm = a? = ae 
y™” = 2ab, 
zm =art+’. 


Suppose first that b is odd, so a is even. Since gcd(2a,b) = 1 then 
there exist integers h,k such that 


2a = h”™, 
oe 
Therefore 42™ = h?-'+4k?-'. Ifp| kthenp /hso4z™ =1 (mod p). 
Hence, squaring this congruence, 16 = 1 (mod p) so p = 5, which is 
not possible because the equation X*+ Y* = Z* has no solution in 
positive integers (Chapter I, (2C)). 
So pk and since p | y then p | h. Therefore h?~' = 4a? = 0 
(mod 16p) and z@-)/? = kP-! = 1 (mod p). 
Suppose now that 6b is even, while a is odd. Proceeding in the 
same way, we show that z~)/? = 1 (mod p). Thus z is a quadratic 
residue modulo p. 


The next result in Raina’s paper is vacuous since its hypothesis is 
never satisfied (namely, z cannot be a prime, by (3C)). 

In 1955, Becker published a paper in which he asserted that Fer- 
mat’s theorem is true for all even exponents 2m > 2. However, his 
proof is definitely wrong. 
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Now we shall give the proof of Terjanian’s theorem, which con- 
tains all the above results as corollaries. Once more, we shall consider 
the quotient Q,(z,y) = (2” — y")/(z —y) where n is an odd natu- 
ral number and z,y are nonzero distinct integers (not necessarily 
positive). 

For the convenience of the reader, we recall that if m,n are nonzero 
odd integers, n > 0, gcd(m,n) = 1, then the Jacobi symbol (m/n) 
is defined. If m > 0 then (n/m) is also defined, and the following 
reciprocity law is satisfied: 


(4.1) (=) = (-1)m-D/2x@—-1/2 (“) 


Moreover, 


(4.2) (=) = (—1)-0/2, 


(4J) Let y,z be distinct nonzero odd integers such that y = 
(mod 4) and ged(y,z) = 1. Let m,n be odd integers, m > 1,7 
1, gcd(m,n) = 1. Then: 
(1) Qn(z,y) =m (mod 4), and in particular Q(z, y) is odd. 
(2) The Jacobi symbols (Qm(z,y)/Qn(z,y)) and (m/n) are well 
defined and equal. 


Vix 


Proor. (1) Let z=y+4t. Then 


(y + 4t)* — y™ m a m sh 
wey) = “_— m m-27e 4... 
Oy Z9) a 1/)Y + 9 |Y + 


= my” '=™m (mod 4), 


because m — 1 is even, y is odd, so y~' = 1 (mod 4). 

(2) First we note that since gcd(m,n) = 1 then the Jacobi 
symbol (m/n) is well defined. Similarly, from gced(y,z) = 1 and 
Chapter II, (3B)(2) it follows that gcd(Qm(z,y), Qn(z,y)) = 1; since 
Q(z, y) > 0 then the Jacobi symbol (Qm(z,y)/Qn(z, y)) is also well 
defined. 

The equality of the Jacobi symbols is proved by induction on k = 
min{m,n}. It is trivial when k = 1. Let k > 1, som ¥# n, because 
gcd(m,n) = 1. 
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If m > n then there exist an odd integer r,0 <<r<n=k, and an 
integer q such that m = qn+r, or m=qn-r. If m=qn+r then 
m —r is even, so by Chapter II, (3B)(1), 


(Geen) 2 (eee) = (3424) = (<) = (=) 
Qn(2,¥) Qn(2,Y) Qn(2,y) n ni} 
If m = qn—r then m—7n and n—,r are even, so by Chapter II, 
(3B)(1), induction and the properties of the Jacobi symbol, 


(SER) - Cea) 


= (aie) = een) (Gem) 
7 Gees, (5). 
Since Q,,(z, y) =n (mod 4) by Part (1), then (Q,(z,y) —1)/2= 
(n — 1)/2 (mod 2), hence by (5.2), (—1/Qn(z, y)) = (—1/n). Thus 


Be aaa) 


Now, if m < n, by the reciprocity law (4.1) for the Jacobi symbol 
and the above proof, 


(GE) = ayn (Ga 


= (=1)0"=D/2x(n—1)/2 (“) _ (=). 
m n 


Terjanian’s result now follows very easily: 


(4K) Let p be an odd prime. If x,y,z are nonzero integers such 
that x2? + y?? = z?? then 2p divides x or y. 


PRooF. There is no loss of generality to assume that 2, y, z are pair- 
wise relatively prime. Also, x,y cannot be both odd, since the expo- 
nent 2p is even (see remark at the beginning of Chapter I, §1). Let 
x be even, so y,z are odd. Then 


a2P = 22P =? y? = (ez 2 y’) 
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By Chapter II, (3B)(4), 


2p 2p 
2° —Y 


ged (2? —y )=p or l. 


? 22 — y? 
If the greatest common divisor is p then p divides x7? so 2p divides 
L. 

We show now that it is not possible that z? — y? and (27? — 
y’?) /(z2 —y?) be relatively prime. If they are, they must be squares. 
But 


gp — yP ze —yP . 2+y? 
Zz? — y? zZ-y z+ty 


= Q(z, y) x Q,(z, —y) 


and these two factors are relatively prime because gcd(y, z) = 1 and 
y and z are odd. So Q,(z,y), Q,(z, —y) are also squares. Since p is 
not a square, there exists an odd prime gq such that p is not a square 
modulo q. 

Assume first that z = y (mod 4). By (4.1), 


(GE) 
q Qqlz,y)/’ 
which is an absurdity, because Q,(z,y) is a square. If z # y (mod 4) 
then z = —y (mod 4), hence again 


(0)-($=9) 


which is again an absurdity. This concludes the proof. 


In 1981, Rotkiewicz showed the following strengthening of Terja- 
nian’s result: 


(AL) Let p be an odd prime. If x,y,z are positive integers such 
that x7? + y?? = z??, then 8p® divides x or y. 


PROOF. We may assume that x,y,z are pairwise relatively prime 
and that x is even, while y, z are odd. Then by Terjanian’s theorem, 
2p divides x. 

Now we show that 8 divides x. We have gced(z? — x?, z? +. x?) =1 
because gcd(a, z) = 1, x is even and z is odd. Since y?? = 2°? —a@?? = 
(2? — xP)(z? + x?) then 2? — x? = [(z? — x?)/(z—2)|(z —2) = a”, 
where a is an odd positive integer. Since p| a then pz sop)z— a. 
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By Chapter II, (3B)(5), ged ((z? — x?)/(z —ax), zx) = 1, hence 
(2? — a?)/(z — x) = b’, with b an odd positive integer (because z 
is odd). Hence 6b? = (2b, + 1)? = 46,(b; + 1) = 1 (mod 8) and 
gP-1 4 gp-24 4+ zP-3y7? = 1 (mod 8), because z is even. But z being 
odd, we have again z?~' = 1 (mod 8), so z?~°a(z+2a) =0 (mod 8). 
Hence y = 0 (mod 8), because z, z + x are odd. 

The proof that p? divides x follows from the result of Vandiver, 
(1B). We have also z? + x? = c??, where c is an odd positive in- 
teger. By Vandiver’s theorem we have x? = x (mod p?). Since 
p|ax and p> 3 then p? divides x, showing that 8p? divides x. 


In 1950, Gut adapted ideas of Kummer and Mirimanoff and used 
methods of class field theory to derive a criterion, involving Euler 
numbers, for the first case of Fermat’s theorem with exponent 2p. 
Of course, this result is now obsolete. 

In conclusion we mention explicitly the following easy fact, which 
will be used later: 


(4M) If x,y,z are nonzero relatively prime integer, n > 3 and 
gen + 2" = 2?" then 2| zy and 3| zy. 


ProoF. If x,y are odd then x?" + y?" = 2 (mod 4); now z must be 
even and 22” = 0 (mod 4), which is absurd. If 3 / xy then 2?" = 
y°" = 1 (mod 8) but 2?” = 0 or 1 (mod 3), so x7" +y?" = 2?” is not 
true. 
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Vil 
Interludes 7 and 8 


This chapter deals with polynomials which are intimately related 
with Fermat’s equation. 


VII.1. Some Relevant Polynomial Identities 


We give here some algebraic identities which are applicable in the 
study of Fermat’s equation. Many of the early attempts to prove 
Fermat’s theorem were based on some polynomial identities. 

To begin, we indicate the following identity, which was used by 
Lamé in 1840; see also Lebesgue (1847), Mention (1847), and Catalan 
(1885); Gauss (1863) gave the special cases when n = 3,5, 7; see also 
Rebout (1877), and Brocard (1878): 


(1A) If X,Y,Z are indeterminates and n is odd then 


(X4Y4Z)" —(X4Y—Z)" —(X-Y4Z)" —(—X4+Y4Z)" 
(n —1)! 


X%zy 2s Zk 
Qi+D!(2j +)! Qk+ 1)! 


=4nxXYZ SO 


i+jtk=(n—3)/2 
i,5,k>0 
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PRooF. By writing explicitly the nth powers of the left-hand side, 
we have: 
(X+Y+Z)" —(X+Y-Z)" — (X-Y4+Z)" —-(-X4+Y+4+Z)" 
! 
= {xv’z'(l - (-1)° - (-1) - (-1)"1. 
a! b! c! 


a+b+c=n 
a,b,c>0 


Since n is odd and a+6+c=n then one or three of the integers 
a, b,c are odd. If only one is odd then 1 — (—1)*—(—1)’—(—1)° = 0. 
Thus, we have to consider only the summands with a = 21+ 1, b= 
2j7+1,c=2k+1, soi+j+k = (n—3)/2 and 1—(-1)*—(-1)? 
(—1)° = 4. We deduce at once the identity of the statement. 


As a corollary, we have (see Werebrusow, 1908): 


(1B) If X,Y,Z are indeterminates and n is odd then 


4 
(RAVE Z =X Steg an (X+¥)(¥+Z)(Z+X) 


(n—1I(X +Y)*(¥ + Z)'4(Z +X) 
(i+ 1)!(27+1! (2k +1)! 


x 


itjtk=(n—3) /2 
i,j, k>0 


PROOF. We write the identity of (1A) with U,V,W in place of 
X,Y,Z, respectively, where U = (X + Y)/2,V =(Y 4+ Z)/2,W = 
(Z+ X)/2. Then 
ULV Wwe xXAV SS 
PivVow =v 
U-V+W =X, 
Spayew =z, 
hence (1A) becomes 
(RE GS Xa = 79 
4 
=F (X+Y)(V 4+ 2)\(Z+X) 


(n—1)1(X+Y)*(¥+Z)74(Z+X)** 
(21+ 1)!(27 +1)!(2k +1)! 


x 


i+jtk=(n—3)/2 
4,5,K20 
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A special case is the following: 


(1C) If X,Y are indeterminates and n is odd then 


(Kirsh 


4 
=> (X+Y)XY 


(nt)! ; 21125 y2k 
eS Gilereerrta ee 


i+jtk=(n—3)/2 
i,j,k>0 


PRooF. It suffices to replace Z by 0 in the identity (1B). O 


Already in 1837, Kummer used the following identity for (X + 
Y)” —(X"+Y™), see also Mention (1847), Vachette (1861), Barisien 
(1906), Boutin and Gonzalez Quijano (1907), Bini (1907), Rose 
(1907), and Bachmann (1910): 


(1D) Jf X,Y are indeterminates and n > 1 then 


(X+YV)"—(X"+Y") = ee (" A a ‘)x Y'(X+Y)""* 


i=l 


(by convention, the terms in the above sum are zero when 2i > n). 


PROOF. We show by induction that the identity 


i nfn—-i- 
Key = (OC EY) + SDE at ‘aura yy - 
t=1 


is true. For n = 1,2 it is trivial. Then 


xn + yr = (xe As Y")\(X + Y) = NV a res) 
= (Xep yet 


an —i-l coh . 

fat xX’y? xX Y n+1—2% 
+c ; ye (", 4 (X+Y) 
_xY(x Hyjrot 


Se yin (" t— *) xy) 1-27 
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= (X+F YY) 4S 0(-1'e XY X + YY) 


i=l 


where c, = n+ 1 and if 7 > 2, then 
7 n—-i-l anes n—-i-1 _ n+l n—-41 
SE at Sin 3S a ay 


In 1885 (and again in 1886) Catalan indicated another form of the 
identity (1B): 


(1E) If X,Y,Z are indeterminates and n is odd, then 


(XY EZ ae Sy e 


= pr-3 + H P’*+ 4H. Ppr-> 
(X+Y)(Y+Z)(Z+X) : . 


(2) 
+ er" + A,-3 + 2H (3) /2 


where P= X+Y+4+2Z = H,; more generally, if i > 1, H; is the sum 
of all monomials of degree i, coefficients 1, in the indeterminates 
X,Y,Z, and Te fs is the sum of all monomials of degree (n — 
3)/2, coefficients 1, in X?,Y?, Z? (so it has degree n — 3), that is, 
ys heaeee =| F(n—3)/2(X7, Y?, ZB?) 


PROOF. We have 


Pr _ (x” + yr + Zo) Pr pan Zr xX” + yr 


G1 = KAY aa ee oo 
= (Po 1+ ZP "74+ Z27?P Fe... 4+ Z"") 
—(X™ 7 - VX" 74 Y2XT 3S... + Y™?), 


But, by Euclidean division, since Y + Z = P— X: 


Prly Zpr? 4 Zpr-3 4... 4 Zr-t 
Y+Z 
= pr? + H,(X,Z)P"? + Ho(X, Z\Ppe" aes + H,_o(X, Z) 
ee ai, ©, Silas oie ce a 
Vag 
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where H;(X,Z) is the sum of all monomials of degree i, coefficient 
1, in the indeterminates X, Z. Therefore 


_ & 
Qs = YZ 
(P" + Hy(X,Z)P"* + Ha(X, Z)P"" + + + Hy 2X, Z)) 


tee + Z)X”? = es a VA, Gia + Oe + a \x 


+ 


ae ae 


Cr a) 
= (PP? + Hy (x, Z) PO + BX, Z) PO eb aX, Z)) 
+ (X"? —(Y —Z)X" 34 (¥Y? - ZV + Z°)xXr 4 — 
SP A ey si ey, 


By Euclidean division, 


PRO? oF CX) Bi Oa PP es ee, SCZ) 
BAX. 
= P® 34 A, Pp" *+ HP"? 4+.---+H, 3 
je EN s OD ae AW eee ee Ha(X, DVS Aes le FE (XZ) 
TAKS 


where H; is as indicated in the statement. Then 
Qe 


Q3 = 74x 
PP PP? abated A 


1 
+55” 24 A(X, Z)Y"? + A(X, Z)y"* 
+-.++ Hy, o(X,Z) +X"? —(Y —Z)x"™? 


+(¥°?-ZY+2Z7)x"*—- 


o] 


—(Y"?-Zy* 34 77y"4_...-Z*)], 
But 
Hi(X,Z) _, 
Z4+xX : 
wn 2 = X?4 27? = H)(X,Z), 
HOG) _ yt 4 x27? 4 74 = HX, 2), 


LX 
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etc. Hence 


Qs = Pes AP eh ee aye 


+H} (X, Z)¥"> + HY (X, ZV" + + HE 5) 9(X,Z) 
1 
Vo TRA EX 
Tae x! =I 2( ’ ) FE al ’ ) oe 


ae He SAV Ke a (ve axe 
+(¥°-Y¥Z4+277)x**- 
_ (yrs _ Zyr-3 + Z2ynu-A4 De ips Ze) 


The expression in the brackets is equal to 


BO, Sma we 1p Sica = fae an Ga a 
HY Z°4+V3Z)X" 64 (VZ4*4+YV9Z72+YV°)X T+ 
HY Z" 34 YV2Zr Oo 4.0.4 "47? +" 7) 
4X"? (Y —Z)X™ 34 (VY? -VYZ+Z*)x"4*- 
—(Y"? —Zy" 34 77y"4 —...—Z**) 

SKE RIX Da ee ZK 
HY* + Y°277? 4+ 24X84 (YZ 4-Y°22 4+ 7°)xX"" 
aba acaatale (Y" 8Z+Y" OZ +--+ Z%%) 

= X™?P4 ZX 34 APY, Z)X™ 4+ ae: ae 
+HP(Y, Z)X"-8 + HPV, Z)ZX"7 4+ + HE 


(Y, Z)Z. 


ae 3)/2 
Hence 
Qg= PP"? + AP * + BoP? 4 eer + Hyg + ee 
HY (X, Z)¥"-8 + HEX, Z)V"T $+ HO 5 9(X,Z) 
+$X3 4 HOY, Z)XP% 40 + AO 9 0(¥,Z) 


= Pr34 HP 4+ HyP > +---+ Hy3 + 2H 5/9, 


since each of the above last two lines is equal to H? 2 3)/2" 
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VII.2. The Cauchy Polynomials 


In his proof of Fermat’s theorem for the exponent 7, Lamé (1839, 
1840) made use of a polynomial identity of degree 7. In the analy- 
sis of Lamé’s paper, Cauchy and Liouville indicated a more general 
polynomial identity (1839), of which the following ones are special 
cases (Cauchy, 1841): 


(X+Y)?—X°-Y° = 5XY(X+Y)(X?+XYV+Y"), 
(X+Y)'—X™-Y" = 7XV(X+YV)(X?4+ XV 4+ Y")?. 


In this way the study of the polynomial (X + Y)” — X" — Y” was 
initiated. 

If n > 3, n odd, then this polynomial is a multiple of X,Y, X +Y; 
moreover, if n = p is an odd prime, then it is also a multiple of p. 


(2A) Letn =+1 (mod 6). The exact power of X?+ XY +Y? 
dividing (X +Y)" —(X"+Y™") has exponent 


_f 1 when n=-1 (mod 6), 
“1 2 when n=1 (mod 6). 


PRooF. We shall show that (X + 1)" — (X" +1) = (X?4X4 
1)°H,,(X) where e¢ is as indicated, H,,(X) € Z[X] and X?+X-+1 does 
not divide H,(X). By homogenization, we deduce the statement 
(2A). 
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Let G,(X) = (X +1)" — (X" +1) and let w = (-1+ V—3)/2 be 
a primitive cubic root of 1, hence w? +w+1= 0. Then G,(w) = 
(w +1)” — (Ww? +1) = -—(w?? +w" +1) = —(w?” — 1)/(w” — 1) = 0. 

So G,,(X) is a multiple of the minimal polynomial X? + X +1 of 
w. Explicitly, by division, G,(X) = F(X)(X?+ X +1) + (aX +b) 
where F(X) € Z[X] and a,b € Z. So0 = G,,(w) = F(w)(w? +w + 
1) + aw+b= aw + b, hence a = 0 (otherwise w = —b/a € Q, which 
is not true) and also b = 0. 

Next, (X* + X + 1)? divides G,,(X) if and only if w is a double 
root of G(X), that is, w is a root of the derivative G,'(X) = n((X + 
1)”-1 — X""1), ie, m[(w+1)""' —w""] = 0. Since w + 1 = —w?, 
this is equivalent to (w”~! — 1)w"~! = 0, that is, w"~! = 1. Finally, 
this holds if and only if 3 divides n — 1, son = 1 (mod 6). 

We show that (X? + X + 1)? does not divide G,,(X). If it did, 
then n = 1 (mod 6) and moreover X? + X +1 divides G,,"(X) = 
n(n—1)((X+1)""-?-X"-*). So G,"(w) = 0, hence (w+1)""? =w”? 
and —w("-?2) = w"-?_ Therefore w*-? = —1 and w*("-?) = 1. This 
implies that 3 divides 2(n — 2), so n = 2 (mod 3), hence n = —1 
(mod 6), which is a contradiction. 


We may therefore write: if m = +1 (mod 6) then 


(2.1) 
(X+Y)"—(X"+Y") = XY(X +Y)(X?74+ XY +Y’)°E, (X,Y), 


and if n = p> 3 is a prime then 


(2.2) 
(X+Y)? — (XP + Y?) = pXY(X+Y)(X?4+ XY +Y7)°C, (X,Y), 


where E,(X,Y) = pC,(X,Y) and e = 1, or 2, according to n = 
—1, or 1 (mod 6). 

There are numerous proofs of the above result (or variants of it) in 
the literature, to wit by Cayley (1878), Glaisher (1878, 1879), Muir 
(1878), Catalan (1884, 1885, 1886), Lucas (1888, 1891), Barisien 
(1906), Taupin and Retall (1907), Ursus and Grigorieff (1907), Can- 
dido (1907), and Bréié-Kostié (1952). 

In 1878, Glaisher expressed the above result in the following form: 


(2B) If nis odd then (X —Y)"+(Y —Z)"+(Z—X)” is divisible 
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by 


A(x —Y)P + (Y —Z)P + (Z— X)?). 
If n= -—1 (mod 6) then the above polynomial is also divisible by 


[(X —Y)? + (¥Y — Z)* + (Z—X)?]. 


1 
2 
Ifn=1 (mod 6), then it is divisible by 


S[(X —Y)*+(¥ -Z)*+ (Z-— X)4). 


ProoF. Let A,B be indeterminates and let n > 3 be odd. Then 
AB(A+ B) divides (A+ B)" — A" — B”. 

Let A= Z-Y, B= X—-Zs0 A+B=X-Y. Then AB(A+B) = 
(Z-Y)(X-—Z)(X -Y) = q(¥ —Z)° HZ —X)P HX Y¥)?] divides 
(xX —Y)"4+(Y —-Z)"+ (Z-—X)”. Next, 


A? +AB4+ B? = (Z-Y)?+(Z-Y)(X —Z)+(X -Z) 
= 4((Y -Z)?+(Z-X)?+(X-Y)’] 
and 
(A? + AB + B?)? aie 


)')- 


ay gon 
E(x = 


Hence by (2A) if n = —1 (mod 6) then $[(X — Y)? + (Y - Z)? 4 
(Z — X)] divides the given polynomial while if n = 1 (mod 6) then 


s((X —Y)4+(Y —Z)*+(Z—X)‘] divides the given polynomial. 


The following special result was explicitly given by Catalan (1884, 
1885), Gérono (1885), Nester (1907), Welsch (1909), and Brocard 
(1910): 


(2C) Let p be a prime number. 
(1) If(X+Y)?—X?-Y? = pXY(X+Y)P?, where P € Z[X,Y] 
then p=3, P=1, orp=7, P= X24 XY4+Y?. 
(2) If 2?-' —1 = pN? where N is an integer then p = 3, N = 
lorp=7,N=3. 
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PROOF. We establish the two statements simultaneously. First we 
observe that p #4 2. Taking X = Y = 1 we have 2? —2 = 2pN?, 
where N = P(1,1) € Z. 

If p= 3 then N = 1. We assume now p # 3. We have (2-1/2? + 
1)(2@-D/? 1) = pN?. The two factors in the left-hand side are odd, 
hence they are relatively prime. So, one of the factors is a square, 
while the other one is p times a square. 

But 2@-)/2 — 1 = 3 (mod 4) since (p — 1)/2 > 2, so 2@-D/? —1 
is not a square. Hence 2°-))/2 +1 = M?, M being an integer. Since 
2-1/2 — M? -1=(M—1)(M +1), it follows that M—1,M+1 
are powers of 2. But (M+1)-(M-1)=2,soM-1=2,M=3 
and necessarily p = 7, N = 3. Therefore 


(XPYV SRY" 
7XY(X+Y) 


P= 


= (X?+XY+Y")?. 


The polynomial C,(X,1) will be simply denoted by C,(X) and 
called the Cauchy polynomial for the prime p > 5. If p = 6k +1 then 
C,,(X) has degree 6(k — 1). We note the following special cases: 


C5(X) = 1, 

CX) = 1, 

Ci(X) = X°4+3X°4+7X4*49X34 7X7? 43X41 
= (X?+XY4+Y°*)P+[XY(X +Y)]?, 

Cig(X) = X°4+3X° 48X44 11X948X7 43X41 
= (X?4+XY4+Y’)?+2[XV(X+Y))?. 


These expressions in terms of XY(X +Y) and X?+XY+Y? will 
soon be generalized for arbitrary values of p. 

The Cauchy polynomials satisfy the following properties, where 
p = 6k +1 (see Mirimanoff (1903), Klésgen (1970)): 


(2D) 


C,(X) = C,(-1 — X). 
(2) C,(0) = C,(-1) = 1. 
l 


X) has no real roots. 
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k —1 disjoint sets, each composed of six distinct roots 


z =} 
1+2z’ z 


{5-42 
a 2), ’ 
Zz 1l+z 


PROoF. (1) Since 


(X41)? - XP -1 
One): pX(X +1)(X2+X+1)¢ 


(with e = 1 or 2) then C,(1/X) = X~?t7¢*8C,(X). From 


_ f 1 when p=6k—-1, 
© |} 2 when p=6k+1, 


it follows that C,(X) = X®*-DC,(1/X). Similarly C,(X) = C,(-1 
—X). 

(2) Since C,(X) is a monic and symmetric polynomial then 
C,(0) = 1. Also by (1), C,(—1) = C,(0) = 1. 

(3) Ifz>0 isa real root of C,(X) then (z+1)? = z? +1, which 
is impossible. If z < —1 is a real root of C,(X) then —(1+ z) > 0 
would be a positive real root of C,(—1— X) = C,(X), which is a 
contradiction. Similarly, if —1 < z < 0 is a real root of C,(X) then 
1/z < —1 would be a real root of C,(X), which is impossible. 

(4) Let z be any root of C,(X). By (1) it follows that 1/z 
and —(1-+ z) are roots of C,(X), hence —(1+1/z) = -—(z+1)/z, 
—1/(1+ z) and — z/(z+1) are also roots of C,(X). The sets of 
roots considered above are either equal or disjoint. Indeed, if 


1 z =| 
1l+27? 14+2’ Zz 


1 
tem, ={2 ,-(1+ 2), 
z 


it is easy to verify that M, = M,. 
Let z be an imaginary root of C,(X), and suppose it is a double 
root; this happens if and only if z is a root of C7). Since 


/ oe 1 ere el 
Ci(X) = 6(k — 1) X8*- YC, (=) a, aman OF (=) 


and C)(X) = —Ci(—1—X), it follows that 1/z and —1 — z are also 
double roots, hence each element of the set M, is also a double root. 
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From (X +1)? — X?-1 = pX(X+1)(X7+X+1)°C,(X), taking 
derivatives we have 


(X +1)P"h— XPh = (X(X 4 1)(X? +X 4-1)9)'C,(X) 
+ X(X +1)(X? +X 4+1)°C,'(X). 


Hence (z + 1)?-! = z®"!, so (1+1/z)’* =1, that is, 1+1/z isa 
(p — 1)th root of 1, so |1+1/z| =1. 

Beginning with the double root —z/(1 + z) (instead of z), we 
deduce that 1+ (—(1+z)/z) = —1/z is also a (p—1)th root of 1, and 
|1/z| = 1. Thus, the triangle with vertices 0,1,€ = —1/z has sides of 
length 1, so it is equilateral, therefore € is a primitive sixth root of 1. 
So €?-—€+1 =0, and €? = —1. Then —(z+1)/z = -1—-1/z = —-1+4+€ 
satisfies 


(-14+ 6) = -14 3€ — 367 + & = -14 3€ —-3€4+3-1=1, 


so its minimal polynomial is X¥? + X + 1. Since —(z+1)/z is also a 
root of C,(X) then X?+X-+1 divides C,(X), contradicting (2A). 


Mirimanoff also investigated in 1903 whether each polynomial 
C,,(X) is irreducible, and he conjectured this to be true. Klésgen 
(1970) has verified with a computer that if p < 31, the Cauchy 
polynomial C',(X) is an irreducible polynomial of Z[X]. Since the 
maximum of the coefficients of C,(X) grows very fast with p, the 
test was not continued any further. 

We report without proof the following results, due to Helou, 1997: 

If n > 3, n odd, then for every prime p, the polynomial C,, modulo 
p is reducible over the field with p elements. Moreover, if n is an odd 
prime and for some prime p, C,, modulo p has at most three factors, 
then C;, is irreducible. Helou attributes to Filaseta the proof that 
C2, is irreducible for every odd prime p; a proof is given by Helou in 
his paper (1997). More results about the Cauchy polynomials are in 
the paper by Terjanian (1989). 

We shall need another expression for the Cauchy polynomials. 
More generally, we shall express S,,(X,Y) = (X +Y)"+(-1)"(X"+ 
Y") as a polynomial in U = X?+ XY +Y? and V = XY(X+Y). 

There are two methods to achieve this result. 
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We may consider the polynomial of degree 3 in the indeterminate 
T, with coefficients in Z[X,Y], whose roots are X + Y,—X,-Y: 
F(T) = (T+ X)(T+Y)\(T-X-Y) 
= 7 + art” + agT + a3, 
with 
a,=X+Y-(X+Y) =0, 
dg = XY —X(X+Y)-Y(X+Y)=—-(X°4+ XY4+Y”), 
a3 = —XY(X +Y). 
Then S,(X,Y) = (X + Y)" + (-1)"(X" + Y”) is the sum of the 
nth powers of the roots of F(T’). We shall use the following classical 
expression, due to Waring (1782), for the sum 


Dy =A Pe Ea, (=e 12k.) 


of the roots of a polynomial 


k 
f is De hi +rt+ + Ap = [[c¢- ee 
t=1 

The special cases n = 1,2,3,4 were known to Girard (1629); see 
also Saalschiitz (1906). The proof may be found in Serret (1885), 
Lucas (1891), Perron (1951). A modern algebraic proof was given 
by Rédei (1952, 1959). Another proof, using power series, is given 
by Cesaro and Kowalewski (1904). 


LEMMA 2.1. With above notations: 


Pn = nT] D(a +++ Gx) Fas 


i=l 


where {(a,+---+@x)'}n ts the sum of all monomials (i!/(41!- ++ te!)) x 


a+» a® (with i, +++» +i, = 1,0 < iy,... ,%%) having weight equal 
to n, that is, 144 + 2ig +--- + ki, =n. 


The next result was proved for even exponents by Ferrers and 
Jackson (1852; see reference in Dickson’s History of the Theory of 
Numbers, Vol. I, p. 747); the proof for arbitrary exponent appears 
in Todhunter’s book (1861). It was proved again by Muir (1879), 
using the above lemma. Kapferer (1949) rediscovered it, with the 
same proof, when n is a prime. In 1969, Carlitz and Hunter gave a 
proof with the method of power series. 
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(2E) Letn>2, letU = X?+XY+Y", andletV = XY(X+Y). 
Then 
OEY P(x)" 4+ EY)” 


= n “a 3r—n+377n—2r—2 
nee 3 s(, 3 2) us 


(summation for max{0, (n—3)/3} <r < (n—2)/2, with the conven- 
tion that if n is even and r = (n —2)/2 then the coefficient of the 
summand is equal to 2). 


Proor. First PrRoor (Muir). By the above lemma 
(EAHA) a(S)" 


= (ae + 05)" 


n ; 4. 
Cal) a 12 _,i3 
=n ; aera 
y U pe ight ie 
2ig+3ig=n 
n 
(tg + 73 — 1) 
= 7; —1)! ie gis 
2 ) ae eA 2 a3 
2ig+3ig=n 
(ig +73 — 1)! 
es eae, 


(sum for all 2,23, such that 0 < ig, 7331 < tg +73; 2ig + 373 = n). But 
—da, = U, —a3 = V and 


(ig+ig-1)! 1 Gas 


inl ig! Ne teed 


(valid when i2 £ 0 or i3 £1). Let i2 +73 -1 =r, hence n—2—-—2r = 
(2t2 + 3%3) 2(i2 } 13 1) a 13. Therefore 19 = 3r —n+ 3 and 


(X +Y)"+ (-X)"+ (-Y)” 


at n r 38r—n+377n—-—2r—2 
=e (sa)? " : 


the sum being extended for max {0, (n — 3)/3} < r < (n—2)/2. 
Moreover, if n is even and r = (n —2)/2 then the corresponding 
summand has coefficient 2 (since i2 = n/2, i3 = 0). 
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SECOND PROOF (CARLITZ & HUNTER). This proof does not 
require Waring’s lemma. We consider the identity 


vA | x 2 Y 
1 Te KW ae 
(X+Y4+Z)-2XVY+VZ+ZX)W4+3XYZW? 


La(K4V SQW OY SYA xy 


Let Z = -(X + Y), so XY+YZ+4+ ZX = -U, XYZ = —V and 
changing signs: 


(2.3) 
(X + Y) xX Y — —-2UW+3VW? 
1+(X+Y)W 1-XW 1-YW $s 1-UW?+VW?’ 


The formal power series expansion of 1/(1 - UW? + VW?) is 
1 1 


1-UW+VWs 1—-W%U—VW) 


> w(U —VW)" 


r=0 


s Ww? oy (") ov" 


r=0 s=0 


ae = _4)\n — r 3r—ny7n—2r 
aps 1)"W ee V 


r=0 


(this last sum is actually for n/3 < r < n/2; by convention the other 
summands are zero). The left-hand side of (2.3) is equal to 


Sot gy! (ayn py. 


n=0 
The right-hand side of (2.3) is equal to 


Co 


—1 n+l n+1 2 Ts 3rtl—nyyn—2r 
Scyerwen yal, Te 


n=0 
= 1 n+2 wr 3 r erry ier 
Fe naar 
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ye vi aa |e oe Vas nyn 2r 


n=0 r=0 


21 n+l nt+1 r 3r+l—-ny7n—-—2r 
+2 aie A 3 Ceara V 
—2UW + 3VW? — 20°w? 


oe n+l r 
a S°(-1) 1)t1prt Dee ( [3srti-nyn—ar 
i= n—1-2r 


n—2r 


= —2UW + 3VW? - out 
ees Ss nti r 
=]. n n 3r+2—ny7n—-—1-—2r 
ips ip renal ee V ; 


age Leer 


because 


9 r 43 r — nt+2 r 
n — 2r n—-1-2r} n—2r\n—1-2r 


(for n > 3). On the other hand, the left-hand side of (2.3) is equal 


= Sey + xr + CYP 


Comparing the two sides of (2.3), as computed above, yields the 
statement. 


We note the following special cases. Taking an odd exponent, 
(2.4) (X 4 a eka = X2kr1 ~ ypekrl 


= s 2k +1 r [73r—2k+2 1 2k—2r—1 
2k —1—2r\2k —2—2r , 


2(k-1)<r<k-1 


If pis a prime, p ¥ 2,3, letting p = 6k +1, we have (see F. Lucas, 
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1897, and Kapferer, 1949): 


(2.5) (X+Y)? —X?-Y? = pXY(X+Y)(X°+XV+Y’) 
x oa e 2 a Ui @e-1/2-siy 2 
= 2i +1 
when p= 1 (mod 6), and 
(2.6) (X+Y)?—X?-Y? =pXY(X+Y)(X°4+ XY +Y’) 
x oa (%, . lan [7(P-8)/2-3i 772% 
= 2i 2i+1 
when p = —1 (mod 6). If the exponent is even, we have 


(2.7) (X+Y)?* + xX 4 7% 


= k L 3r—2k+3 7 72k—2r—2 
7 » k-1 -( 2 2)! . 


(2k—3)/3<r<k—-1 


In 1879, elaborating on previous papers of Glaisher (1878, 1879), 
Muir found the following recurrence relation and divisibility proper- 
ties of the polynomials S,,(X, Y): 


(2F) 
(1) VS,(X,¥) + USp41(X,Y) = Sn43(X,Y) forn>0. 
(2) 6S,(X,Y) = 3:92(X, Y)Sn-2(X, Y) Te 253(X,Y)Sn_3(X, a) 


forn> 3. 

(3) Ifn =0 (mod 6) then U,V )'S,(X,Y). 
Ifn=1 (mod 6) then U?V | S,,(X,Y). 
If n =2 (mod 6) then U | S,(X,Y). 
If n =3 (mod 6) then V | S,(X,Y). 
Ifn=4 (mod 6) then U? | S,(X,Y). 
If n =5 (mod 6) then UV | S,(X,Y). 


Proor. (1) Let n = 2m. By (2.4) and (2.7) and noting that 


2m r—1l x 2m+1 r—1l 
2m — 2r\2m—1— 2r 2m+1—-—2r\2m — 2r 


= 2m+3 r 
— Im+1— 2r\2m — 2r 
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it follows that 
V Som(X, Y) + USam4i(X, Y) = Som+3(X, VY): 
Taking n = 2m + 1 and proceeding similarly, we deduce that 


VSom41(X, Y) + USom42(X, VY) = Somys(X, Y) 


and this proves (1). 
(2) Noting that S.(X,Y) = 2U and $3(X,Y) = 3V, it follows 
from (1) that 
3S2(X, Y)Sn—2( X,Y) + 253(X, Y)S,-3(X, Y) 


(3) This follows immediately from (2.4) and (2.7). 


Using an extension of Waring’s formula, MacMahon indicated in 
1884 some more algebraic identities of the same family. 
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VIII 
Reformulations, 


Consequences, and Criteria 


In this chapter we give a variety of results, a good indication of the 
wide search for solutions of Fermat’s problem. There are reformula- 
tions into equivalent problems, also a number of consequences of the 
truth of Fermat’s theorem as well as statements which follow from 
the assumption that the theorem is false for some exponent. 


VIII.1. Reformulation and Consequences of Fermat’s Last The- 
orem 


In this section, we shall indicate some propositions which may be 
proved if we assume the truth of Fermat’s last theorem. Among 
these propositions, some imply, conversely, the truth of Fermat’s 
last theorem. 


A. Diophantine Equations Related to Fermat’s Equation. 


There have been many instances where certain diophantine equations 
were compared to Fermat’s equation. We describe in succession the 
results originated with Lebesgue, Christilles, Perrin, Hurwitz, and 
Kapferer. We also discuss briefly the equations of Frey which played 
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a central role in the recent approach and solution of Fermat’s prob- 
lem by Wiles. 


Al. Lebesgue 


The first result of this kind in the literature is due to Lebesgue in 
1840. It was proved again by Terquem in 1846 and rediscovered by 
Pocklington in 1913: 


(1A) Jf Fermat’s last theorem is true for the exponent n > 3 then 
the equation X?" + Y?" = Z? has only trivial solutions. 


PROOF. Suppose that x,y,z are nonzero positive integers such that 
x" +y?" = z?. It is easily seen that we may assume without loss of 
generality that x,y,z are pairwise relatively prime. Moreover, x, y 
cannot be both odd, otherwise z? = 2 (mod 4), which is not true. 
So, for example, x is even, y is odd, hence z is odd. 

Let + = 2°2’, with a > 1, 2’ odd, x’ > 0. Then (z+ y”)(z—y”) = 
2? — 2" = 7?" — 2?ang/?” We note that gcd(z+y", z — y”) = 2, so 


eye = oreo, 
Zz =e y” — Pee ae 


with r,s odd, positive, gcd(r, s) = 1. 
Adding and subtracting, we obtain 
z= pen Se alae cal 
- 


mr 


Say nh D0enN 8 gi ae (ph a Dane l gt h(a 4 an ten). 
Since ged(r” — 2971's”, r” + 2°"~ 1s”) = 1 then 
{ pe fee a, E> 0; 
pe geet a a >. 
Therefore 


ple + yu” —_ OD. iBe Ge —_ (2%s)”. 
By hypothesis, we must have u = 0, hence r” = 2°°~'s” is odd, 
so an = 1, which is a contradiction. 


As a corollary, Liouville proved in 1840 (see also Terquem, 1846): 


(1B) If Fermat’s last theorem is true for the exponent n > 3 then 
the equation X?” — Y?" = 2%” has only trivial solutions. 
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PROOF. Suppose that x,y,z are nonzero integers such that x2?” — 
y2” = 22". Lett = yy?" + 2", sot— 2" = y*" and also t+ 2” = x”. 
Hence t? — 22” = (ry)?”, so 27” + (ay)?” = t?. 

By (1A) we must have t = 0, and then necessarily z = 0, ry = 0, 
which is a contradiction. 


A2. Christilles 


The next result was proved by Christilles in 1967. We begin with 
a lemma: 


LEMMA 1.1. The equation X°+Y?+ Z? =3XYZ has a solution in 
nonzero integers x,y,z if and only ifat+ty+tz=0orr=y=z. 


PROOF. We have the identity 


(1.1) X°4+Y°4+ 2°-3xXYZ 
=(X+Y4+Z)(X°+Y°+2Z7-XY-YZ—- ZX). 


If x,y,z are nonzero integers then 2° + y? + z° = 3ayz if and only if 
etytz=lore’?+y4+227=acyt+ yet 22. 

However, for any integers x,y the equation Z? — (w+ y)Z + (a? + 
y” — cy) = 0 has solutions z = ((x + y) + (x — y) V—3) /2, which are 
integers exactly when x = y, and in this case x = y = z. 


(1C) Letn>3. The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent n. 
(2) The only solutions in nonzero integers x,y,z of X°° + YY?" + 
Le a BIOYS™ wreg = y= so 


Proor. (1) — (2) Assume that x,y,z are nonzero integers which 
are not all equal and such that 2°” + 43" + 23" = 3a"y"z". By the 
lemma we have 7” + y” + 2” =0. 

(2) — (1) Suppose that x,y,z are nonzero integers such that 
zc’ t+y" +2” =0; then x,y, z cannot be all equal and by the lemma, 
gn + ye” + yn = Bays”. 


Christilles has also indicated in the same paper the following suf- 
ficient condition for Fermat’s theorem: 
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(1D) Letn>3. If X?+Y"4+ 7" =5XY "Z(G" — XY") 
has no solution in nonzero integers, then Fermat’s last theorem is 
true for the exponent n. 


PROOF. Suppose that x, y, z are nonzero integers such that 7” +-y"+ 
z” =0. Then 


—2°" = (a +y")P = oP ty + 50%y” (29 $y") +1007 ya" +y"). 
So 
pe py? + 2°" = Baty” [e8” + yh” — Baty™2” + ary”). 
By (1.1) and the hypothesis 7°” + y?” + 23” = 3x"y"z", hence 


xr $y? + 2°" = Bary2”[2?" — oy” with x,y,z 4 0, proving the 
result. 


A3. Perrin 


Perrin showed in 1885 the following fact concerning the cubic Fer- 
mat equation: 


(1E) = The following statements are equivalent and true: 


(1) Fermat’s last theorem is true for the exponent 3. 
(2) For every n > 1 the equation 


(1.2) X?4+Y? 433-173 =2x% 3B°XVZ 
has no solution in nonzero integers x,y,z, not multiples of 3. 


PRroor. (1) — (2) Suppose that there exists an integer n > 1 and 
nonzero integers x,y,z, 3 / xyz, satisfying equation (1.2). Let 


Since x,y,z # 0 and 3 / xyz then a,b,c # 0. From (1.2) we have, 
taking the square and the cube 


ot yo 4 BoP? 781 993 yh 42 x 3391 73 2342 x 337-1823 = Ax 37% 9? yz? 
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and 
a9 + y9 £3°"-329 4 325y3 4 Ba3y% + 339523 
42 x 33% q3y323 4 33%y5 23 4 360-1736 4 36n-1,,3 56 

= 8x 3°" gz y3z3. 
Next we have 

a? =, ge (By _ aaa J gc + Bh lgtyz — 2°), 

b = wis a2 = ele Se + 3°* 1 gytz _ y*), 

fou = 328 (eg = a aT ea + Br ye: e_¢ Boney 
Adding these equalities, taking into account (1.2) and the preceding 
relations, leads in a straightforward manner to 

a+b? +c=0. 


(2) — (1) Suppose that there exist nonzero integers a,b,c such 
that a? +b? +c? = 0; we may assume a, b,c pairwise relatively prime. 
From the identity 


(X+YV+Z)j%=X°+V24+ Z774+3(X+V)\(Y + Z)(Z4+X) 
it follows that 
(a+b+c)>? =3(a+ b)(b+c)(c+a). 


But a+b, b+c, c+a are also pairwise relatively prime; for example, 
if a prime p divides a+b and also b+ c then since p divides a+ b+ c 
it would divide c and a, contrary to the hypothesis. 

Hence, one of the factors, say a+ b, is a multiple of 3, and neces- 
sarily there exist integers n > 1, x, y, z such that 


atb=39""12%, 
b+c=2°%, 


and 3"xryz =a+b+c. Hence 
e+ y? +3°""129 =%(atb+c) =2 x 3"zryz. 


Finally, x,y, z 4 0 (if, for example, z = 0 then a = —bsoa = —b= 
+1 which implies c = 0) and also 3 / xyz because a+b, b+c¢,c+a 
are pairwise relatively prime. 


240 ~=~VIII. Reformulations, Consequences, and Criteria 


A4. Hurwitz 


In 1908, Hurwitz considered the diophantine equation 
(1.3) X™Y"+Y™Z" + Z2"X" =0, 


where m > n > 0 and ged(m,n) = 1, without loss of generality. He 
proved: 


(1F) = The above equation has only the trivial solution if and only 
if Fermat’s theorem is true for the exponent m? — mn + n?. 


ProoF. Let x,y,z be nonzero integers such that 2”y” + y™z" + 
za" = 0. We may assume gcd(z, y, z) = 1. 

Let u = gcd(y, z), v = ged(z,x), w = ged(z,y). Then u,v, w are 
pairwise relatively prime. Hence vw divides x and we may write 
“= ©,vUwW, X, an integer. Similarly y = y,wu, z = z,uv with y, 21 
integers. Moreover, 21,41, 21 are pairwise relatively prime. Substi- 
tuting in the equation and dividing by u"v"w” we have 


eye Pw + yp ewe ul” + zPafu™ "v™ = 0. 
Thus u™—” divides z?yfu™ "w". But ged(u,v) = 1, ged(u,w) = 1 
and gcd(u,2) = 1, hence gced(u,2,) = 1. So w”~” divides y?. On 
the other hand, y? divides z?aju™-"v™. Since y; is relatively prime 


to 2%, 21, and v then y? divides u”~". Hence y = tu”. 

In the same way 27 = tw™”, zi = +vu™™. Since ged(m,n) = 1 
then gcd(m,m —n) = 1, hence 2) = uw", ww =u”, a =v” 
for some integers ui, U1, Wi such that u = tu}, v = tui, w = tuy. 


It follows that 


wi te pein An aint ab ul” yea te r)mayim TO) i 


a ee gg EOE = 


mn n mn n? mn 


Multiplying with u?~™"v? wi , and noting that m? — 
mn +n? is odd, we obtain the equation 


un ee + (tuyere + (tu, )™ mnt? =—_ 0. 


Conversely, if the nonzero integers u,v, w satisfy the relation 


yr mnt? + ym mnt? + wim wnt? Zs 0, 
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mn mn 


multiplying with wv" w™", we obtain 


yr te mnryymn 4 gym?-+n® mn mn 4 gym? +n? mn,.mn — 0. 


Vv uw wey 
Letting 
L=v"w"™, 
pa Ny,m 
y=wu, 
Zuo. 


we obtain xy” + yz" + 2x" = 0. 


As a special case, letting m = 3, n = 1, it follows that 
MY +Y°7+ ZY =0 
has only the trivial solution. 
A5. Kapferer 


In 1933, Kapferer published a proof that Fermat’s last theorem is 
true for the exponent n > 3 if and only if the equation 


(1.4) yaaa aaa ae ee ila, Gaile 


has no solution in nonzero pairwise relatively prime integers x, y, z. 
The proof of Kapferer contained a flaw, partly corrected by Riben- 
boim (communicated to Inkeri). Inkeri found and proved (correctly) 
the results which we give below. Comments by Gandhi and Stuff 
(1975) on this matter were inaccurate. 
We begin with the special case n = 3, which was given by Fueter 
(1930): 


(1G) The equation 
(1.5) Gea Vrss? xox 
has no solution in nonzero integers x,y,z with gcd(y, z) = 1. 


ProoF. Assume that 2z,y,z are nonzero integers such that 
gcd(y, z) = 1 and 
Saf SB Oe. 


Let 


v = 6723 — y, 
w = 6x2. 


| u= 6723 + y, 
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Then u,v,w € Z, w #0. We show that u4~0,v 40. Ifu=0 
then 622°? = —y, so y # +1. If p is a prime dividing y, then p | 62. 
So p | z, which is contrary to the hypothesis that gced(y,z) = 1. 
Similarly, v = 0. 

Finally, u? + v? = (622° + y)? + (6723 — y)? = 2 x 68a? + Bary? = 
6323 (2 x 632° + y?) = 63x3z3 = w. This is, however, impossible. 


It may be shown in the same way that there are no integers 
L,Y, 2,4 0 with ged(y, z) = 1 such that z3 — 3y? = 242°. 

We give now Inkeri’s equivalence, which is a corrected form of the 
one previously given by Kapferer (1933). It also involves the related 
equation 


(1.6) Violets) sie salam, Gals 


(1H) Letn>3 be an odd integer. The following statements are 
equivalent: 
(1) Fermat’s last theorem is true for the exponent n. 
(2) The equations (1.4) and (1.6) have no solution in nonzero 
integers x,y,z with gcd(y, z) = 1. 


PrROooF. (1) = (2) We assume that there exist nonzero integers 
“,y,z such that gcd(y,z) = 1 and z3 — y? = 3° x 22°-2g", Let 


a=y, 
b=3 x Qrlyr, 


C= 2%, 


so a? +3b? = c?. Then c is odd and ged(a, b) = 1 because ged(a,c) = 
ged(y, z) = 1. 

By Chapter I, Lemma 4.7, there exist integers r,s 40, gcd(r, 3s) = 
1, r#s (mod 2), such that 


a =r(r? — 9s), 
b = 3s(r? — s?), 
c=17r?+3s?. 


Then 
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Since x 4 0, then s, r—s, r+ are nonzero. Also s is even, because 
r #8 (mod 2). Since ged(r, s) = 1 then s, r— s, r+ are pairwise 
relatively prime. So there exist nonzero integers u,v,w such that 
2s=w",r—s=u",—-r—s=v". Sou®+v"+w" =0. We note 
that u,v,w #0 and gcd(u,v, w) = 1. 

Now we assume that there exist nonzero integers x, y, z such that 
gcd(y, z) = 1 and 23 — 3y? = 2?"-1y?". Let 


— 9n-1yn 
aS 20-5", 
b=y, 

C= 2%, 


so a? + 3b? = c®. Again c is odd and gcd(a, b) = 1. 
By Chapter I, Lemma 4.7, there exist nonzero integers 1, s, 
gcd(r,3s) = 1, r #8 (mod 2) such that 


a=r(r? — 9s"), 
b = 3s(r? — 8”), 
c=17r? +387. 


Then 
2°14" =a=r(r —3s)(r + 3s). 


We note that r is necessarily even, since r — 3s, r+ 3s are odd. So 
r, r — 3s, r+ 3s are pairwise relatively prime. Hence there exist 
nonzero integers u,v, w such that 2r = w", —r+3s =u", —r—3s = 
v” and so u” + vu" + w” = 0, with u,v, w 4 0, gced(u, v, w) = 1. 

(2) > (1) We assume that u,v, w are nonzero integers such that 
u” + vu" + w” = 0. Without loss of generality, we may assume that 
u, UV, w are pairwise relatively prime, w is even, and u,v are odd. Let 
s=w"/2andr=u"+s. Hence 2s = w", r—s=u", -r—s=v". 
Let 

a=r(r? — 9s"), 
b = 3s(r? — 8”), 
c=r* + 3s’. 


By substitution, we obtain a? + 3b? = c? and b = 3s(r? — 8?) = 
—3 x 2"-1 (Luvw)”. Let 

x= iLuv, 

y= a, 

Cae 
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so z3 — y? = 33 x 227-29?"_ Since gced(u,v) = 1 then ged(r,s) = 1 
and « # 0 by assumption. Now w is even, then s is even, so y is odd, 
hence y #4 0. Next, 3 / uwuw if and only if 3 | r. Indeed, if 3 /uow 
then 3 /'s(r? — s*), so necessarily 3 | r, and conversely, if 3 | r, from 
gcd(r,s) = 1 then 3 /s(r? — s*), so 3 /uvw. 

Now we show that gcd(x,y) = 1 which will be done in several 
steps: 

(a) Since s is even and gcd(r,s) = 1 then r is odd. Then ged(r? — 
s*, r? — 9s?) = 1. Indeed, if p | r? — s*, and p | r? — 9s? then p # 2 
and p | 8r?, p| 8s?, so p| gcd(r, s) = 1, which is impossible. 

(b) ged(r,r? — s*) = gced(s, r? — 9s?) = 1 because ged(r, s) = 1. 

(c) Then gced(a,b/3) = 1. Since x | b/3 and y = a then 
gcd(x,y) = 1. Since y is odd, we also have gcd(2z,y) = 1. Next 
we have: gcd(a,b) = 1 or 3. More precisely gcd(a,b) = 1 when 
3) r and gcd(a,b) = 3 when 3 | r. We note 3 | r if and only if 
3 | r(r? — 9s?) =a and this is equivalent to 3 /uvw. 

Also gcd(y,z) = gced(a,c) = 1 or 3 according to 3) r or 3 | r. 
Indeed, if p is a prime, e > 1 and p* | gcd(a, c) then from a?+3b? = c’, 
p* | 3b? so p| b, hence p = 3, gcd(a, b) = 3 and 3? /c, showing that if 
gcd(a,c) £ 1, then ged(a,c) = 3 and 3 | r. Conversely, if 3 | r then 
gcd(a, b) = 3, so 3? /c and by the above, gcd(a,c) = 3. 

If ged(y, 2); = 3 then. 9 | y. Let ay = 244 = 7/9, er = 2/3, 
then z? — 3y? = 227-472" with #,4,,2: 4 0, ged(yi, 21) = 1. This 
concludes the proof. 


For further use, we note that (with the above notations), gcd(2z, y) 
= 1. Here is a related result: 


(11) Letn>3. The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent n. 
(2) The equation (1.4) has no solution in nonzero integers x,y, z 
such that gcd(2x, y) = 1. 


PROOF. (1) = (2) Assume that x,y,z 40, gcd(2z,y) =1 and 2?- 
y? = 3° x 227-17?" Tf p is a prime and p | gcd(y, z) then p / 22, so 
p = 3, so 3? | y?, hence 9 | y. Let a, = 2, y = 9/9, m = 2/3, 
so 23 — 8y? = 27°12?" with 21, 41,21 # 0, gcd(y1, z1) = 1. By the 
implication (1) = (2) of (1H) then there exist u,v,w 4 0 such that 
uu” +v" +w" = 0. 
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(2) = (1) Assume that u,v,w 4 0, ged(u,v,w) = 1, and u”® + 
uv" + w" = 0. By the proof of the implication (2) = (1) of (1H) and 
the remark following this proof, there exist x,y,z 4 0, gcd(2z,y) = 
land 2) = 923" 6 2 


We examine in more detail the situation: 


(1J) Assume that (1.4) has no solution in nonzero relatively prime 
integers. If u,v,w #0, ged(u,v,w) =1, and u" + v" = w”, then: 
(1) n is odd, u=v = —w (mod 3), 3 /uvw. 
(2) If p is any prime dividing n, then p=1 (mod 6). 


Proor. (1) By (2) = (1) in (1H), there exist nonzero integers 
x,y,z such that 23 — y? = 3° x 2??~! x?” with ged(y, z) = 3 and also 
3 /uvw. 

By Chapter VI, (4L), n is odd, hence uw” + v" + (—w)" = 0. Let 
r=u",s=-—v",sor,s#0,r #5 (since w #0). We have r # 5s 
(mod 3), since 3 /w. Then u = u” =r #8 = —v" = —v (mod 3). 


Since 3) uv then u = v (mod 3). By symmetry, u = v = —w 
(mod 3). 

(2) Let p be a prime dividing n, n = pt, uw, = u', vy) = v*, w) = 
w', so ul + uf + (—w,)? = 0 with gcd(u1, v1, w1) = 1, wy = v1 = - 


(mod 3). By Chapter VI, (2A), we have p= 1 (mod 6). 


It was shown by Inkeri, using nonelementary methods (namely 
class field theory), that under the above circumstances, 3? = 3 
(mod p?); this congruence is very seldom satisfied by a prime p. 

It is appropriate to indicate now some facts about the equation 


(1.7) ZY =c. 


(For more details, see my book, Ribenboim, 1994.) Euler proved 
in 1738 that if c = +1 the only solution of (1.7) in integers greater 
than 1 is y = 3, z = 2. It was shown by Siegel in 1929, using deep 
analytical methods, that for each given c there exist at most finitely 
many solutions for equation (1.7). 

Using (1H) we may show: 


(1K) fea 3° x2? %a?’ where vm >. 3, ¢ = 1p p or pq (for 
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distinct primes p,q), then equation (1.4) has no solution in nonzero 
relatively prime integers y, Zz 


ProoF. Assume that y, z are nonzero relatively prime integers such 
that 23 —y? = 33 x 2?"-?2?"_ By the proof of implication (1) > (2) of 
(1H), we have 2"-!z" r(r?—s?) with r,s 4 0, r #8 (mod 2), ged(r, s) 
= 1. Then gced(r,r? — s?) = 1 and clearly r? — s? 41. 

If x = 1 then r = +1, r? — 8? = +2""!, so s? = 2%"! +1, which is 
impossible. 

If x = p then 2"~'p" = r(r? — s?). If p = 2, the above argument 
leads to a contradiction. If p # 2, then either r = 2"~', r? — 5? = 


p”, or vice versa. In the first case, r—s = +1,r+ 5s = +p” (or 
r—s=+p",r+s= +1); in all cases, we reach a contradiction like 
pea (p" = 1) or similar relations. 

If x = pq and ay or 2 is 2, we use the preceding argument. If Prd 
are odd then 2"~'p"q” = ae —s)(r+s). Ifr = +p" then r—s5 = 
+2" rts= Lg" ie +-q”) or vice versa, and 2p" = +(2"-' +q"), 
which is impossible. Similarly, r # +q”, sor = +2. !,r—s= 
+p", r+s = +q" (with appropriate sign) hence 2” = tp” +q". By 


Chapter VI, (3B), this is impossible. 


The paper of Yahya (1973), where he published a proof of Fermat’s 
last theorem, contained flaws of which one was the use of the incor- 
rect result of Kapferer. Yahya has also related Fermat’s equation to 
another diophantine equation. Inkeri examined this relationship and 
proved correctly the following statement: 


(1L) Letn>3. The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent n. 
(2) The equation 


(1.8) 227-27 x8 _Y?X —Z"=0 
has no solution in nonzero integers x,y,z such that gcd(z,y) = 
gcd(y,2z) = 1. 

A6. Frey 


We find in the thesis of Hellegouarch (1972) an elliptic curve asso- 
ciated to a hypothetical solution of Fermat’s equation with exponent 
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2p” (where n > 1 and p is a prime); the aim was to show the non- 
existence of points of certain orders in elliptic curves. 

In 1986, Frey had, independently, the same idea to associate to 
each solution of Fermat’s equation an elliptic curve. Explicitly, if a,b 
are nonzero relatively prime integers, n > 3 and a is even, the Frey 
curve is the curve with equation 


(1.9) Y? = X(X—a")(X +6"). 


It is an elliptic curve, whose properties were studied by Frey. Ribet 
and Wiles studied this curve under the assumption that there exist 
a nonzero integer c such that a” + 6" = c”. The use of advanced 
theories of elliptic curves, modular forms, Galois representations, 
led ultimately to a contradiction, thus showing that Fermat’s last 
theorem is true. We shall discuss this matter further in the Epilogue. 


B. Reformulations of Fermat’s Last Theorem. 


The following statements, which are equivalent to Fermat’s theo- 
rem, were given by Pérez-Cacho in 1946. The equivalence between 
(1), (2), (8), and (4) was first proved by Bendz in 1901 and was 
rediscovered by Krasner, who published his paper in 1939 (see also 
Rivoire (1968)), and by Chowla in 1978 (see also Inkeri (1984)). Lind 
indicated some partial results in 1909. 


(1M) Let m > 2,n = 2m—1. The following statements are 
equivalent: 


(1) The equation X"+Y" = Z” has only the trivial solutions in 
integers. 

(2) The equation X(1+ X) = T” has only the trivial solutions 
in Q. 

(3) The equation X* = 4Y" +1 has only the trivial solutions in 
Q. 

(4) The equation X* = Y"*! —4Y has only the trivial solutions 
in Q. 

(5) For every nonzero rational number a, the polynomial Z? — 
a”Z +a is irreducible over Q. 

(6) The equation (XY)™ = X +Y has only the trivial solutions 
in Q. 

(7) The equation X™ = X/Y + Y has only the trivial solutions 
in Q. 
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(8) If u,r are nonzero rational numbers, and if u,,U2,... 18 @ 
geometric progression of ratio r, then u2,-—u, +r #0. 

(9) If A is a triangle with vertices A, B,C, if the angle CAB = 
90°, if |AB| = 2, |AB| + |BC| is an nth power of a rational 
number then |AC| is not rational. 


Moreover, these conditions imply: 


(10) The tangents to the parabola Y* = 4X at every rational point 
distinct from the origin, cut the curve Y = X™ at irrational 
points. 


PRroor. (1) — (2) Let a,b,c,d be nonzero integers, such that 
b>0,d> 0, gced(a,b) = gced(c,d) = 1, and a/b(1+a/b) = (c/d)”. 
Then a(a + b)/b? = c"/d". Hence a(a + b) = c”, b? = d”". Since n is 
odd, then b = y” for some integer y. Since gcd(a,a + b) = 1, then 
a=x2",a+b= 2", for nonzero integers y,z. Thus 2” + y” = z”. 

(2) — (3) Let x,y be nonzero rational numbers such that 2? = 
4y” +1. Then « # +1 and ((# — 1)/2)((@ —1)/24+1) =y". 

(3) — (4). If w,y are nonzero rational numbers such that 2? = 
y"t! — dy, dividing by y"*1 = y?™, we deduce that 


2 n 
(=) =1+4 (=) | 
The 7] 

(4) — (5) The discriminant of Z? — a”Z +a is a"*! — 4a 4 0 
(since n is odd). By hypothesis, a”*! — 4a cannot be a square, hence 
Z* —a™Z +a has no root in Q. 

(5) — (6) Ifa,y are nonzero rational numbers such that (ay)™ = 
x+y, let ry =a. Then Z? —a™Z +a has a solution in Q. 

(6) + (7) Let x,y be nonzero rational numbers such that 2” = 
x/yt+y. Let t=2/ysot+y= (ty). 

(7) + (1) Let a,b,c be nonzero integers such that a” + b” = c”. 
Let x = c?/ab and y = cbh™"'/a™. Then 

a ca™* — cb™-1 — c(a® +b”) (S . “3 
= = =o. 
y bm a™ amb ab 

(3) — (8) Assume that u2, = u,—r. Since u, = uyr™* then 
the equation r2°"-)) Z? — Z +r has the rational root u;, therefore its 
discriminant is a square, that is, 1 — 4r?"~! = s?. Hence s and —r 
satisfy X* = 4Y" +1. 

(8) — (3) The proof of the converse is similar. 
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If x,y are nonzero rational numbers such that 4y" + 1 = 2x? then 
the equation y2°"-) Z? — Z — y = 0 has a rational root u,. Let 
us; = u1(—y)*}; then u2, = ury2-) = u, + (—-y). 

(5) < (9) The condition (5) is equivalent to the fact that a?” —4a 
is not a square in Q, that is, a?” — 4a” is not a square in Q, or still, 
(a™—-!Z)? = a?” — 4a” has no rational solution, for every rational 
number a 4 0. Adding 4, this is equivalent to the non-existence of 
rational solutions for 


4+ (a™~Z)7 = (a” — 2)’, 


which is in turn equivalent to (9). 

Now we show that: 

(7) — (10) Let (21, y1) 4 (0,0) be a rational point of the parabola 
Y? =4X. The tangent to the parabola at this point has equation 


YY = Q(X + £1), 


that is, 
y 
so 
2 
ene ares 
Y1 2 
The intersections of the tangent with the curve Y = X”™ are the 
points (x,y) such that y = (2/y,)a + y,/2, y= 2. Thus (2, y,/2) 
is a solution of the equation X” = X/Y + Y. By (7), x must be 
irrational. 


In 1958, Pérez-Cacho showed: 


(IN) Letn>2. There is a bijection between the following sets: 


(F) The set of solutions (x,y,z) of X"+Y" = Z", where x, y, z 
are nonzero pairwise relatively prime natural numbers; and 

(F’) the set of solutions (u,v, w,t) of U"+V?" = W"+T?" where 
u,v, w, t are nonzero natural numbers, gcd(u, v) = gcd(w,t) = 
gcd(v,t) = 1, andw=vecd(u,w), u=tged(u,w), t# v. 


ProoF. Let (z,y,z) € F, letu=22z,v=y,w = yz,t =x. Then 
v#t, since x £ y because 2x7” is not an nth power. Also u” +0?" = 
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creer ty = ar (arty) $y” = (ar ty”)y™ +207" = ye +07" = 
wr + t?”, 

We have u,v, w,t # 0, ged(u, v) = ged(w,t) = gced(v,t) = 1, z = 
gcd(u,w) and u = tgcd(u,w), w = vgecd(u,w). Thus (u,v, w,t) € 
FF’, 

Conversely, let (u,v,w,t) € F’. Let x = t, y=, z = ged(u,w). 
Then w = vz = yz, u = tz = xz and x7" — 7?" = 7 — y*"™ = 
u” — w" = (a” — y")z”. Since t £ v then x” + y” = 2”. 

Clearly the above correspondence between (F') and (F’) is a bijec- 
tion. 


In 1979, Vranceanu indicated a less interesting property equivalent 
to Fermat’s last theorem for the exponent n. 

We give now an equivalent combinatorial (!) formulation of Fer- 
mat’s last theorem. It appeared in a short note by Quine (1989). 

Consider a set of n > 3 balls, which are to be arranged into z bins 
which are white, red, or blue. Let: 


W = number of white bins; 
B= number of blue bins; and 
R= number of red bins. 


Then z=W+8B+R. Let: 


(r’b) = number of arrangements of the n balls into the bins, such 
that red bins receive no balls, but at least a blue bin has a 
ball. 

(rb’) = same, but with no balls in blue bins and some ball in some 
red bin. 

(rb) = same, with at least a ball in some red bin and at least a ball 
in a blue bin. 

(w) = same, with all balls in the white bins. 


We have: 


(10) Fermat’s last theorem is true for the exponent n if and only 


if (w) # (rb). 


PROOF. We have 


Als 


So 


= number of all arrangements of the n balls in the z bins. 


2” = (w) + (rb) + (rb’) + (rb). 
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Let « = R+W and y = B+W. Then the number of all arrangements 
of the n balls into the bins which are red or white is 


x” = (w) + (rv’). 


Similarly, the number of arrangements of the n balls which are blue 
or white is 


y = (w) + (r'B). 
If Fermat’s last theorem is true for n > 3 then 2” 4 x” + y” and, 


comparing, (rb) 4 (w). 
On the other hand, if Fermat’s last theorem is false for n > 3, let 
x,y,z be positive integers such that 7" + y” = 2”. Let W,B,R be 


given by 
B=2z-2, 
R=z-y, 


W=2x+y-%. 


By the above argument, (w) = (rb). 
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VIII.2. Criteria for Fermat’s Last Theorem 


In this section we gather various results of a different nature, all 
proved with elementary methods. For the convenience of the reader, 
we classify them into subsections: 


A. Connection with Euler’s totient function. 

B. Connection with the M6bius function. 

C. Proof that a nontrivial solution cannot be in arithmetical 
progression. 

D. Criterion with a Legendre symbol. 

E. Criterion with a discriminant. 

F. Connection with a cubic congruence. 

G. Criterion with a determinant. 

H. Connection with a binary quadratic form. 

I. The non-existence of algebraic identities yielding solutions of 
Fermat’s equation. 

J. Criterion with second-order linear recurrences. 

K. Perturbation of one exponent. 

L. Divisibility condition for Pythagorean triples. 


A. Connection with Euler’s Totient Function. The first result 
was proved by Pérez-Cacho in 1928, in a slightly weaker form: 


254 ~~ VIII. Reformulations, Consequences, and Criteria 


(2A) Let x,y be nonzero relatively prime integers, let n > 2 be 
an integer, and let p be any prime factor of n,n = pm. Let z be 
any integer such that z > 3, z divides x” ty”, but z does not divide 


m 


z™+y™. Then p divides y(z). 


ProoF. If p = 2, it is true since z > 3, so y(z) is even. 

Let p be an odd prime, n = pm, and suppose that p / p(z). Then 
there exist integers r,s such that rp — sy(z) = 1. Since y(z) is even, 
then r is odd. 

If we assume 2?” = —y?™ (mod z) but 2” # —y™ (mod z), then 
grpm = —y"P™ (mod z) hence x se@)+)m = —ylsel)+)™ (mod z). 
We have gcd(z,2) = gcd(z,y) = 1. Indeed, for example, if a prime 
q divides x and z, it divides x” + y", hence also y, contrary to 
the hypothesis. By Euler’s theorem, 2? = y?) = 1 (mod z), so 
x™ = —y"™ (mod z), which contradicts the hypothesis. 

The proof is similar when z | 2" — y", z| a" —y”™. 


In particular, if gcd(x,y) = 1 and p| n and taking z = a” +y", 
then p| y(a” + y”). As a matter of fact, as shown by Pérez-Cacho, 
this last assertion also holds when gcd(z,y) = d # 1. Indeed, let 
x = dr, y = dy, so gcd(mi,yi) = 1; then p | y(a7? + y7); but 
gz’ ty” = d"(xi + yf) hence p| y(a" + y”). 

As a corollary, we have the result proved by Swistak in 1969: 


(2B) [fp is an odd prime and0 <a < y < z are positive integers 
such that x? + y? = z? then p divides v(x), p(y), and y(z). 


PROOF. We may assume without loss of generality that x,y,z are 
pairwise relatively prime, because if, for example, x = dx, and p | 
(x1) then also p| v(x). 

We have 3 < z, z | a? + y? and z x + y; indeed, z? = x? + y? < 
(a+y)?,soz<a+y < 2z. By (2A), p| y(z). 

Similarly, x | z? — y’, « /z—y since z—y < x. Also 


x? = (z-y) (2 + pgP ry t+ (;) gP Sy? 4 ee v) 


> 2P7!p > 2P, 


so xz > 3. By (2A), p| y(z). 


VIII.2. Criteria for Fermat’s Last Theorem 255 


Finally, y | 2? —a?, y/z—a since z-—a2 < yand3<2<y, so 
again p | p(y). 


Bussi indicated in 1943 the following corollary: 


(2C) Ifx,y,z are pairwise relatively prime positive integers, if p 
is a prime, p | xyz, and x? + y? = z?, then there exist primes q,r, 8, 
such that q=r=s=1 (mod p) andq|z,r|y,s|z. 


PRooF. By (2B), p divides v(x), y(y), y(z). Since p) xyz then 
there exist primes q,r,s, such that p|q—1, p|r—1, p|s—1, and 
q\2,r|y, 8|z. 


This result is also a corollary of Chapter III, (1B). Another remark 
made by Bussi in 1932 is the following: 


(2D) Letn> 2, and let x,y, z be pairwise relatively prime positive 
integers such that x”+y” = z”. Ifk is any integer such that p(k) =n 
then gcd(k, xyz) > 1. 


ProoF. If gcd(k, yz) =1 then 2?) =y?) = 7°) =1 (mod k). But 
p(k) =n, sol =z" =2"+ y” =2 (mod &), which is absurd. 


B. Connection with the Mobius Function. Rameswar Rao 
proved in 1969: 


(2E) Ifn>3 is an odd integer, and if x,y,z are positive integers 
such that x” + y” = 2”, then the Mobius function pp vanishes for 
r+ y. 


PROOF. Since n is odd x+y divides x” +y” = z”. Any prime factor 
p of «+ y divides z” hence divides z. 

If w(x+y) £0 then x+y has no square factor, hence x+y divides 
z, in particular x+y < z. This is a contradiction since z < x+y. 


C. Proof that a Nontrivial Solution Cannot be in Arith- 
metical Progression. The next result is due to Bottari (1907); it 
was rediscovered by Goldziher (1913), by Mihaljinec (1952), and by 
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Rameswar Rao (1969). In 1908, Cattaneo gave the following simple 
proof, which is exactly the same as Rameswar Rao’s: 


(2F) Ifn>2andz,y, z are positive integers such that x"+y" = 2” 
then x,y,z cannot be in arithmetic progression. 


PROOF. Let us assume the contrary, so there exists a positive integer 
a such that x =y-—a,z=y+a. Then 


(2.1) (y—a)"+y" =(y+a)”. 


Dividing, if necessary, by the greatest common divisor of a,y, we 
may assume that gcd(a,y) = 1. We also see that y cannot be odd. 
The relation (2.1) implies that y” = am, for some integer m. Since 
gcd(a, y) = 1 then a= 1, so (y—1)" + y" = (y+ 1)”. 

We see that n cannot be odd, otherwise 


and since y is even then 2”~! would divide the odd number in brack- 
ets. 
Since n is even then dividing by y: 


rea[(er Gyno) 


soy"! = 21. Hence y/2 divides I. Since y/2 divides each summand 
in the bracket but the last one, it follows also that y/2 divides (,",) = 
n, soy < 2n. 

Hence y"~! > 2()y"-? > yt, a contradiction. 


D. Criterion with a Legendre Symbol. In 1958, Pérez-Cacho 
indicated the following criterion: 


(2G) _ Let p be an odd prime. Assume that if x,y, z are any nonzero 
relatively prime integers then there exists a prime q, q # DP, q dividing 


z? — pry and 
prt _§ 4 
(? ?) ey 
qd 


Then Fermat’s last theorem is true for the exponent p. 
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PROOF. We assume that there exist nonzero relatively prime integers 

x,y,z such that x? + y? = z?. By hypothesis, there exists a prime 

q, 4# p, such that q divides z* — pry and ((p?t! — 4p)/q) = —1. 
From 27? = x77? + 2x?y? + y’? it follows that 


ZRH!) _ A22gPyP — 22(?P — QoPyP + y??) = [2(a? — y”)]?. 
Let k be defined by z? = pry + kq. Then 


(p?t* — 4p)(axy)?** = (2? — kq)?t* — 4(ary)?(2? — ka) 
= 22+) _ 472 ¢PyP 


= [z(x? — y?)]? (mod q). 


Since p+ 1 is even, then p?*! — 4p is a square modulo q, which is a 
contradiction. 


E. Criterion with a Discriminant. In 1949, Kapferer gave a cri- 
terion involving the discriminant of a certain polynomial. In the 
proof of this result we shall require some facts about the resultant 
and the discriminant of binary forms, which have been gathered in 
Chapter IT, §4. 

The following lemma was explicitly used by Kapferer; a proof is 
given in his paper. 


LEMMA 2.1. Let F(X,Y), G(X,Y) be forms of degree n,m, respec- 
tively, let L(X,Y), M(X,Y) be forms of degree k, and let 


®(X,Y) = F(L(X,Y), M(X,Y)) 
(form of degree kn), 

T(x, Y) = G(L(X,Y), M(X,Y)) 
(form of degree km). Then 

R(®,T) = [R(F,G)|" [R(L, M)]””. 


ProoF. The result is trivial if n = 0 or m = 0, so we assume n > 
1,m->l. 

Let F(X,Y) = []Tj_, (a, X — aiY) where aj, a are not both zero 
(for each i = 1,...,n). Thus ®(X,Y) = F(L,M) = [[j_, (aL - 
a,M). Similarly, let G(X,Y) = IjLi(GX — BY), so T(X,Y) = 
11. (8.L — B,M). 
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Then by Chapter II, (4A), 


R(®,T) =] ]] ReiL — aM, BL — B;M). 
i=1 j=l 
For example, let 6; 4 0; then 
R(aiL = a;M, BL = 0;M) 
1 / / 
a ap R( Gia, L — B;0,M, B,L = B;M) 


8; 
1 


«BF 


’_ a. 3lyk 
= (Bras = 085)" ce, B,L — B;M) 


R ((Bye¥, - a48))L, BL — BM) 


| 
QB 
£ 

| 
& 
Pe 
“% 
B 
GS 


Hence 


Now we give Kapferer’s criterion: 


(2H) Let p be a prime number, p > 7. If there exist nonzero 
integers x,y,z such that 


p Vxyz(a — y)(y — z)(z— a) (27 + + 27) 


and x? + y? + z? = 0 (mod p”), then p divides the discriminant of 
the homogeneous polynomial 


(@—n/6 (PT? _ ny i, 
K,(X,Y)= 2 KPa yt 
(XY) d ( 2i \aa 
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when p= 1 (mod 6), 


(8) / (EP GN. = 
K,(X,Y) = 2 ce ea 
cass: ( , \aaa 


when p = —1 (mod 6). 
PROOF. Indeed, 
w’=x (modp), y*=y (modp), z2=z (mod p), 


hence x + y+ z= 0 (mod p) and so z? = —(a+y)? (mod p?). Thus 
(x + y)? — x? — y? =0 (mod p”). 

We note that y— z= 2+2y (mod p), z—x = —(2%+y) (mod p), 
a +y? +2? = (2? + zy + y’) (mod p), hence p / ry(x + y)(x — 
y)(Qax + y)(2y + x)(2* + zy +y?). 

We have seen in Chapter VII, (2.2), that 


(RPV PS XPS VP SOX (XE VCE KY EV OX.Y), 


where C,(X,Y) € Z[X,Y] is the homogenized Cauchy polynomial 
and e = 1 or 2, according to whether p = —1, or 1 (mod 6). Thus p 
divides C,(a, y). 

We show that p divides the discriminant of C,(X,Y). Indeed, let 
Q(X,Y) = XY(X + Y)(X?4+ XY + Y")®, so 


; [(X +)? — X? - ¥*] = Q(X,Y)C,(X,¥). 


Hence taking the partial derivatives: 


(x+y? xe = (Ky) SP(x,¥) + Bex, VOY), 
(X+Y)?*-Y? = Q(X, yyjee (X,Y) + 4 22 ix, VC ike 


OY oY 


Since p / xy(x + y) then x?~! = y?"! = (x + y)?"' = 1 (mod p); 
on the other hand, p / Q(z,y). Therefore, since p divides C,(x, y) 
then p | (0C,/0X)(a, y) and p | (OC,/0Y)(x,y). So, 


OC, OC, 


Discr(C,(X,Y)) = R (S (*%Y), OY 


—* (Xx, v)) =0 (mod p). 


260  ~=VIII. Reformulations, Consequences, and Criteria 


Now we use the expression of C,(X,Y) as a homogeneous poly- 
nomial in 
L(X,Y) = (X24 XY +23 = 03, 
M(X,Y) = X?Y?(X +Y)?=V?, 
which was derived in Chapter VII, (2.5) and (2.6): C,(X,Y) = 
K,(L,M) where 


(P—7)/6 (PT? _ Ny Lae 
K,(L,M) = 2 SL OOM 
when p= 1 (mod 6), 
(p-5)/6 (PT? _ nary 
K,(L, M) = 2 = LP *)/8—* yg 
when p = —1 (mod 6). We have 
dC, _ OK, OL | OK, OM 
OX ~~ OL OX OM OX’ 
OO, AK, OL. «AK, “OM 
OY OL OY OM OY’ 


where 

OL 2 2)2 

OM 

eS OXY AK LY Ox ay 

oN (X+Y)@X +), 

L 

~ = 3(X°+ XY +Y°)?(X +2Y), 

OM 

aS OR AV (XY V(X Pay). 

= (X+¥)(X 42¥) 
Letting L(az,y) =r, M(r,s) = s, then p /rs and from 

OC, _ OC, _ 

ax (OY) = Fy (@¥) =9 (mod p), 
it follows that either 


OK, OK, - 
Fpl 8) = Spbl(rs) =O. (mod p) 
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or 
OL OM 
det on i pe =(0 (mod p). 
OL OM 


ay my) oy (ey) 
Computing the determinant, this condition becomes 
6(x? + cy + y*)?(2Qe + y)(x + 2y)zy(x + y)(z@ —y) =0 (mod p). 


However, by hypothesis this determinant is not a multiple of p. 
Therefore 

OK, ~ OKs 
OL (r,s) — OM (r,s) 
that is, p divides the resultant of the binary forms (OK,,/0L)(X,Y), 
(0K,0M)(X,Y), having degree m — 1, where 


=0 (mod p), 


aad 

= 6 
= —5 
— when p=-—l1 (mod 6). 


when p=1 (mod 6), 


Since L(X,Y), M(X,Y) have degree 6, by Chapter V, (1A), 


aK, aK, 
R(SBOLY), SECGY)) 


= LR (Sem), ome, uy) Ree Y), M(X,Y))-, 


R(L(X,Y), M(X,Y)) = R((X24 XY +¥Y%)3, X°v2(X +Y)?) 
= [R(X?+ XY +Y?, XY(X+Y))]° =1. 


Hence p divides the resultant 


R (Se(.M), ome, M)) = Discr(K,(L, M)). 


We note that we have actually shown that if r = (x?+a2y+y’)* #0 
(mod p) and s = x*y?(a + y)? #0 (mod p) then (r,s) is a multiple 
root of the congruence K,(L,M) =0 (mod p). 

From (2G) we obtain the following criterion: 
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(21) Let p be a prime number. If there exist nonzero relatively 
prime integers x,y,z such that x? + y? + z? = 0 then either 


p | xyz(x — y)(y — z)(z—2)(x? + y? + 2”) 


or p divides the discriminant of the polynomial 


F(T) = F(T) =) ee ‘ 2i : Te 


4 


where m = (p — 3)/2 and the summation extends from i = 0 to 


—7 
— when p=1 (mod 6), 
— 

—5 
a when p=-—1 (mod 6). 

PROOF. Suppose that x,y,z are nonzero relatively prime integers 
such that p /xyz(a—y)(y—z)(z—2) (a? +y?4+27) and a? +y?+z? = 0, 
hence p > 7. 

Then x? + y? + 2? = 0 (mod p”), hence by (2G), p divides the 
discriminant of K,(X,Y), and more precisely, (r,s) with 


r=(a?+ay+y*)? £0 (mod p), 
s=xy(a+y)* £0 (mod p), 


is a multiple root of the congruence K,(X,Y) =0 (mod p). 
We write T = Y/X, so 


K,(1, ¥/X)=F(D), -K,(X,Y) = X"F(2), 


where 
at 
pe when p=1 (mod 6), 
7 when p=-—l1 (mod 6). 
But 
OK, 
= mX™ 7 F(T) — XX" CY F(T 
Re = mx" F(T) (n), 
OK, 


= xr), 
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and since (r,s) is a common root of the congruences 


OK, _ 
72 (r,s) = 0 (mod p), 
OK, _ 
oY (r,s) = 0 (mod p), 


letting t € Z be such that tr = s (mod p), then 


{ 0=mr™" F(t) —r™-78F'(t) (mod p), 
0=r™'F’(t) (mod p), 


so F(t) = F’(t) = 0 (mod p). This implies that p divides the dis- 
criminant of F(T). 


F. Connection with a Cubic Congruence. In 1944, Pierre used 
a method somewhat similar to the one of Kapferer, to reduce Fer- 
mat’s equation to two systems of congruences. 

To begin, we need a lemma (part of these assertions were proved 
by Mirimanoff in 1907 and Skolem in 1937; see also Skolem (1941)). 


LEMMA 2.2. Let p = 6k +1 be a prime number, let a,b € Z, p /b, 
and consider the congruence 


X*?+aX+b=0 (mod p). 


Let t € Z be such that tb? = a® (mod p), and 


tA, 6 OO D)OS 1) aes 
Uae ere 


Ip — (28 + D]lp — (2s + 3) --- Ip — (4s — 1) 
22(s-1) (2s — 1)! 


+(—1)*"!(2s — 1) [Rs 


(1) If —(4t + 27) ts not a quadratic residue modulo p, the con- 
gruence has a unique solution x,0 <a“ <p. 

(2) If 4t + 27 = 0 (mod p), the congruence has two distinct so- 
lutions %1,%2, 0 < %1,%2 < p. 

(3) If —(4t+ 27) is a quadratic residue modulo p and if V,(t) = 
0 (mod p) then the congruence has three distinct solutions 
%1,%2,X3, 0 < 41, %2,%3 <p. 
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(4) If —(4t + 27) is a quadratic residue modulo p, but V(t) #0 
(mod p), then the congruence has no solution. 

(5) There are k values of t such that the congruence has several 
distinct solutions and 2k values of t for which the congruence 
has no solutions. 


In this respect, Cailler (1908) indicated a relation between the 
resolution of congruences of third degree and linear recurrences of 
second order; see also Mirimanoff (1909). 

Let F(T) be the polynomial defined in (2H). With the above 
lemma, Pierre proved: 


(2J) Let x,y,z be relatively prime integers such that p / xyz(a — 
y)(y — z)(z—a)(a? + y? + 27) and a? + y? + 2? =0. Let r = (2? + 
sy+y’)’, s= 2° y(r+y). 
(1) If t satisfies the congruence rt = s (mod p®) then F,(t) =0 
(mod p?). 
(2) Ift satisfies the congruence r + ts =0 (mod p) then 


(——*) =1,  W,(t)=0 (mod >). 


PRooF. (1) By Chapter VI, (1C), 7 +y+z=0 (mod p?). Hence 
xP + y? = —2zP = (x + y)? (mod p*). Thus p* divides (x + y)? — 
x? — y? = pry(a + y) (2? + cy + y”)°C, (2, y) with e = 2 when p= 1 
(mod 6), e = 1 when p= —1 (mod 6). Since p /z then p x + y. 

Similarly, since p /x?+y?+2? then p /a?+ay+y?. Thus p® divides 
C,(x,y). With the previous notations, C,(2,y) = K,(r,s) =r” F,(t) 
(mod p*), with m = (p — 7)/6 when p = 1 (mod 6), m = (p—5)/6 
when p = —1 (mod 6). Since p /r then F,(t) = 0 (mod p’). 

(2) We observe: the congruence 


(oe? +ay+ yi +Tx*y*(@+y)? =0 (mod p) 
has a solution t if and only if 


0 (mod p), 


Pa? + tdxe +t 
0 (mod p), 


d?y? + tdy +t 


with some d, not multiple of p. 
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Indeed, if dx, dy are solutions of X?+tX +t =0 (mod p), then 
the other solution is congruent to —d(a + y) modulo p. Hence 


{ dlzy —x(x+y)—y(z+y)] =t (mod p), 
—@xy(x£+y) = —t (mod p). 


Therefore 
d?(x? + ry + y*) =—t (mod p), 
{ d’ry(z+y) =t (mod p), 
so (x? + xy + y’)? 4+ ta?y?(x + y)? =0 (mod p). 
Conversely, let d be such that dry(x + y) = —(xa? + xy + y’) 
(mod p). Consider the congruence X? + aX + b = 0 (mod p) with 
roots dx,dy, —d(a + y). Then 


{ ad [xy — x(x + y) — y(a + y)| =a (mod p), 
—Pxry(x + y) =—b (mod p) 
Thus 
{ d(x? + ay+y’) =—a (mod p), 
ary(x+y) =b (mod p) 


So b=a=t (mod p), hence 


d’x? + tdx +t =0 (mod p), 
dy? + tdy +t =0 (mod p). 


It follows from the above considerations that the congruence X?+ 
tX +t = 0 (mod p) has three incongruent solutions dz,dy, and 
—d(x+y). Indeed, if, for example, dx = —d(x + y) (mod p) then 
x = —ax(x + y) = z (mod p), contrary to the hypothesis. By the 
preceding lemma, —(4t + 27) is a quadratic residue modulo p and 
Wv,(t) =0 (mod p). 

This concludes the proof. 


It is worthwhile to recall that if p = —1 (mod 6) then p | 2? + 
xy +y’, hence p x? + y? + z? (see Chapter VI, Lemma 1.2). 

But Pollaczek has also shown in 1917 (see Chapter VI, (2J)) that 
even if p= 1 (mod 6), from x? + y? + 2? = 0, p / xyz, it follows that 
pla? +ayt+ y’, or equivalently, p /x? + y? + 27. 

Similarly, if ¢ = y (mod p) then x? = y? (mod p”), z = —2z 
(mod p), z? = —2?x? (mod p?), so 2x? = 2?x? (mod p?). Thus if 
p) x then 2? = 2 (mod p”). Actually, Inkeri showed in 1946 that 
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2? = 2 (mod p*) (see his paper). Hence, if p is such that 2? 4 2 
(mod p*) then necessarily p /(x — y)(y — z)(z— 2). 

Wieferich showed in 1909 that if the first case of Fermat’s last the- 
orem is assumed false for the exponent p then 2? = 2 (mod p”). This 
was the first of a series of criteria of similar type, discovered by Mi- 
rimanoff, Vandiver, Pollaczek, Rosser, and Granville. The proofs of 
these results are lengthy and nonelementary. For a fuller discussion, 
see my book 13 Lectures on Fermat’s Last Theorem. 


G. Criterion with a Determinant. In 1907, Bini used a classical 
recurrence relation to obtain an expression in terms of a determinant: 


(2K) Let x,y,z be any numbers, and let 


a=xr+yt+Z, 
b=aytyz+ 2a, 

C= xyz, 
and for everyn > 1 let S, = 2" +y" + 2”. Then: 
i} Sn = aSn—1 + bS,—2 + CSn—3 = 0. 


2 
0 -a b -e 0O 0 0 
0 1 -a 6b -e 0 0 
0 0 1 -a 0b 0 0 
0 0 0 1 -a 0 0 
Snadet] i 2: : 
0 0 0 0 0 b -c¢ 
3c 0 0 0 0 —a 0b 
—2b 0 0 0 0 1 -a 
a 0 0 0 0 1 


ProoF. (1) «2,y,z satisfy the equation 
X?— aX? +bX —c=0. 
Multiplying with X”"~*, we have 
X? —@X" 7 4 bX"? —exX™ 3 =0. 
Replacing X by x,y, z and adding the relations so obtained, we have 
Sn — ASn—1 + OSn—2 — CSn—3 = 0. 
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(2) We write the above relations for k = 1,2,...,n. Thus 
S1,59,...,5n Satisfy a system of n linear equations: 


Sn me aSy—-1 + bSin~2 = CSn—3 — 0, 


S4 = ass + bS>2 = cS = 0, 
S'3 = as + bS; = 3C, 
So = as} = —2b, 
Si = a. 
The determinant of the coefficients of the system is equal to 1. Ap- 


plying Cramer’s rule, it follows that S,, is given by the determinant 
indicated. 


It should be pointed out here that Bini proved that this implies: 
if p is an odd prime and x+-y+z=0 then ryz divides x? + y? + z?. 
But, as a matter of fact, this statement follows at once from the 
remark preceding (2A) of Chapter VII, noting that —z=ax+4 y. 


H. Connection with a Binary Quadratic Form. In 1963, Pig- 
nataro linked Fermat’s equation with the representation of a pth 
power by means of a binary quadratic form. 

We first recall the following well-known fact. Fermat considered 
in 1657 the equation X? — dY? = 1, where d is a positive integer, 
but not a square (this equation has been erroneously named after 
Pell). Fermat stated that he had proved by the method of descent 
the existence of infinitely many solutions in integers. However, the 
first published proof is due to Lagrange, around 1766. Explicitly the 
result is the following: 


LEMMA 2.3. Let p be a positive integer, but not a square. There 
exists a solution in positive integers (21,y1) for the equation X? — 
dY? = 1 such that: (x,y) is a solution in integers if and only if there 
exists an integer m such that x+yVd = (ay +yvd)™. In particular, 
for different values of m one obtains different solutions and therefore 
X? —dY? =1 has infinitely many solutions in positive integers. 


The proof of Lagrange used continued fractions. Another proof 
may be found, for example, in Ribenboim (1999). 

We introduce the following notation. If b,c are nonzero integers, 
let (b,c) = bX? + cY?. 
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(2L) Let p be an odd prime, and let x be an odd positive integer 
which is minimal such that there exist positive integers y, z satisfying 
xP + yP = zP. Then: 


(1) yz is a quadratic residue modulo x; and 
(2) x? is represented by the quadratic form (z,—y) in infinitely 
many ways. 


ProoF. (1) We first note that since x? + y? = z? and x is minimal 
then x,y,z are pairwise relatively prime. We write 


2 2 
a? = 2? —yP =z (ee) say (gee) 


so (z,—y) represents x?. Also, z (2@-/2)? =y (y@e-v/2)? (mod 2), 
hence yz (2-0/2)? = y?*! (mod x), so yz is a quadratic residue 
modulo x. 

(2) Now we show that y, z cannot be both squares. Otherwise 
g= yi 2H— 27 with 71,2; > 0y-s0 


xP = xP — yy? = (22 — yh) (a + yf). 


Since x is odd and gced(y, z) = 1 then ged (2? — y?, z? + y{) =1 and 
clearly 2? + yf > 1. Hence z? — y? is a pth power of an odd positive 
integer x’, 0 < 2’ < x, which contradicts the minimality of x. 

As seen above, x? = za? — yb? where a = z@-Y/?, b = y®-V/?, 
Consider the equation X? — yzY” = 1. Since gcd(y, z) = 1 and y, z 
are not both squares, then yz is not a square. 

By the above lemma, there exist positive integers u,,v, such that 
1 = uj — yzv; and moreover for every integer m if tm + Um/Y¥Z = 
(ur + v1,/yz)™ then 1 = u?, — yzv?,. We note that if 1 < m then 
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Um <Um+1 and Uy < Um4i. Then 


a? = (za? — yb’) (uz, — yzum) 


face 

eH) Gomme) om) (ere 
=z (atin + bUmy) = (Amz + bu) 2| 

(atin + bUmy) + (GUmz + bun) 4] 


= z(aUm + bumy)? — y(aumz + bum)?. 


This shows that x? is represented in infinitely many ways by (z, —y). 


I. The Non-Existence of Algebraic Identities Yielding So- 
lutions of Fermat’s Equation. In 1895, Jonquiéres investigated 
whether there would be algebraic relations connecting hypothetical 
solutions of Fermat’s equation. 

If n = 2, we have the algebraic relation 


(x ha (e Vee = (2XY)? 


and, as was indicated in Chapter I, (1A), this yields all primitive 
solutions of the Pythagorean equation. 
We shall see that nothing of the kind exists if n > 2. 


LEMMA 2.4. Let F = Fo + Fi + Fo+--- € Z[X,Y] where F; is the 
homogeneous part of degree i of F. 

Ifn > 1, there exist homogeneous polynomials of degree n, P2 © 
Z|Xo0,X1], P3 € Z[Xo, Xi, X2],... (depending only on n) such that 
the homogeneous parts of F” are: 


(F")o = (Fo)”, 


(F°) = n(Fy)" "Fh, 
(F")2 = n( Fo)" "Fe + Po(Fo, F,), 
(F")3 = n(Fo)""1 Fs + P3( Fo, Fi, Fe), 


(F"), = n(Fo)" “Fi, + Pe( Po, A,--- , Fe-1)- 


270 ~=VIII. Reformulations, Consequences, and Criteria 


PRooF. By raising F' to the nth power, we note that the homo- 
geneous part of (F"), consists of n(Fo)"~*F;,, plus a contribution 
involving the homogeneous parts of degree less than k. This is a 
polynomial expression, with multinomial coefficients, independent 
of F’, depending only on n. 


(2M) Ifn > 2, there do not exist polynomials F,G € Z[X,Y] 
such that X"Y" = F” —G". 


Proor. Let F= Fo+ fF, + Fot+::-,G=Go+G,+Ge+--- where 
F,,G;, are the homogeneous parts of degree k of FG, respectively. 
If PP = GS XY ™, then’ (F"), = (Ge tor k= Uae. 2a 
1, (Fon = (G")on = XPY By the lemma, Fo = Go, wa »Fon-1 = 
Gyn—1 and hence also 


n( Fo)" * Fon = (F" Jon = Poy (Fo, ba ney  Fon-1) 
(G”)on + xXx”"y” = Pon(Go, Gi, Aaret ,Gon-1) 
= n(Go)"" Gon + XY", 
) 


so n(Fo)"~! (Fan — Gon) = X"Y". This implies that n = 1, which is 
a contradiction. 


We recall from Chapter VI, (3C), that ifn >2,if0<4r<y<z 
are integers such that «” + y” = z”, then y is not a prime power. 
So y = ab, with a,b > 1, gcd(a,b) = 1. The above result tells that 
it is impossible to find polynomials F,G € Z[X,Y], such that for all 
a,b € Z ab” = |F(a, b)|” — [G(a, b)]”. 


J. Criterion with Second-Order Linear Recurrences. We re- 
phrase the results in Chapter III, (1A), (1B), (1C) in terms of second- 
order linear recurring sequences (see Kiss (1980)). We recall that if 
m > 1, if (Ry)g>0 is a sequence of integers, then r(m) is the smallest 
index r such that m divides R,. 


(2N) Let p be an odd prime, let x,y, z be relatively prime integers 
such that 2? +y°+z2? =0. Let A=ax-y, B= —xy, Ro = 0, R, = 1, 
and for every k > 2 let Ry = ARy_-1 — BRx-2, and D = A? —4B = 
(x+y)’. 
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(1) Ifp /z then R, =d?, where d is an integer, Rp = D@-V? = 
1 (mod p”) andd = 1 (mod p), r(d) = r(d?) =--- = r(d?) 
p (with the notation of the lemma). 

(2) If p | z then R,/p = d?, where d is an integer, R, = p 
(mod p*),D = 0 (mod p*?~?) and r(d) = r(d?) = ++. = 
r(d?) =p, r(d?*") # p. 


PROooF. (1) The roots of X* — AX + B are x,—y. By Chapter V, 
Lemma 2.1, Rp = (x? + y”)/(a+y) =@ (with d>1),c¢+y=C, 
as follows from Chapter III, (1A), where c= t, d= t, =1 (mod p), 
by Chapter III, (1B), (with the notation previously used). Then 
Ry, = d? = 1 (mod p*). Since p} z and + y+ z = 0 (mod p) 
then p/x+ysop/c. Hence D?-)/? = (4 + y)P-1 = PY =1 
(mod p?). 

Finally, since d? | R, then by Chapter V, Lemma 2.2, r(d),r(d?), 
--. ,r(d?) divide p; since d > 1 then r(d’) 4 1, so r(d) = r(d*) = 
a= Td) =p, 

(2) Assuming that p | z, it follows as before from Chapter III, 
(1B) and (2C), that R, = (a? + y”)/(a@+y) = pd’, whered=t, =1 
(mod p?), 7+ y = p"?-'c? where n > 2, c=t. Then R,/p = d? =1 
(mod p*) so R, = p (mod p*). Also, D = (x+y)? =0 (mod p*?~?). 

Finally, since d? | R, but d?*! / R, then r(d) = --- = r(d?) = 
p, r(d?*") # p. 


And now we give a result of Kiss and Phong (1979), containing 
an interpretation of the congruence g?~' = 1 (mod p”) in terms of 
an appropriate recurring sequence. 


(20) Let p,q be distinct primes such that p #4 2 and p/q-—1. 
Let A=q+1,B=q, Ro = 0, R; = 1 and every k > 2 let Ry = 
AR,_1 — BRy_2. Let D = A*? —4B = (q—1)?. Then the following 
statements are equivalent: 

(a) g?-" =1 (mod p’); 

(b) p?/Rp-1; and 

(c) r(p) = r(p’). 
PROOF. (a) — (b) The roots of X*-AX+B= X?-(q+1)X+q 


are a = q, 3 = 1. By Chapter V, Lemma 2.1(1), Rp-1 = (q-' — 
1)/(q—1). Since p /q—1 then p? | R,-1 if and only if p? | g?~' — 1. 
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(b) — (c) By Chapter V, Lemma 2.2(5), r(p) divides p—(D/p) = 
p—1, since D is a square. Let p—1 = sr(p). By Chapter V, Lemma 
2.2(8), we have Rp1 = Ror) = SRr~@ReGj41 (mod R?,,)). 

Since p| Ryp) then p? | R%,); also by Chapter V, Lemma 2.2(2), 
P| Ror), PY Rep 41 and p /s (since s < p—1). Hence p* | R,_, if 
and only if p? | Ry(p). Let up(Ry(p)) = k > 2. 

By Part (6) of Lemma 2.2 of Chapter V, r(p*) = r(p). But since p | 
R,(p2) then r(p) | r(p?). On the other hand, 2 < k, hence p? | Rp(pey = 
Rp), hence r(p”) | r(p), showing the equality. And conversely, if 


r(p) = r(p?) then p? | Ry). 


K. Perturbation of One Exponent. The following result (see 

Schaumberger (1973), Klamkin (1974)) is rather a curiosity; it tells 

that if one “perturbs” even slightly one of the exponents in Fermat’s 

equation, the new equation has infinitely many solutions in integers. 
We first show: 


(2P) If a,b,c are integers, a,b,c > 1, gcd(ab,c) = 1, then the 
equation X° + Y° = Z* has infinitely many solutions in integers. 


PrRooF. It is trivial if c= 1. Let c #1. We note that there exist 
integers d,e such that abd + 1 = ce. We have d ¥ 0, otherwise 
ce = 1,c > 1, hence c = 1, which has been excluded. So, there 
exists an integer t such that d+tc > 1 and ab(d+tc)+1 = c(e+abt) 
with e+ abt > 1 since c > 1. Thus, there is no loss of generality to 
assume that d > 1, e > 1, and abd+1 = ce. 

Let u > 1 be arbitrary and let 


r= gbdyjbe 
om ’ 
| y= gadyac 
= ’ 
b 
z= 2u™. 
Then gt +y? — gabd,abe 4 Qabdaabe = gabd+1y,abe _ JQeeq abe _ 2, Since 


u is arbitrary, the given equation has infinitely many solutions in 
integers. 


It follows at once: 


(2Q) Ifn,k > 1 then X°+Y" = Z"*1/" has infinitely many 
solutions in integers. 
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PROOF. First we note that (x,y,z) isa solution of X"+Y" = Z*r+1 
if and only if (x,y, z*) is a solution of X° +Y" = Z"+"/F_ Taking 
a=n,b=n,c=kn+1 in (20), we see that the given equation has 
indeed infinitely many solutions in integers. 


L. Divisibility Condition for Pythagorean Triples. In 1913, 
Niewiadomski considered the polynomials in three indeterminates 


Do= Dil A422) = =1, 
and if n > 1, 
Di DX Y 2) eR 
He observed the identity, for n > 1, 
(2.2) Dn4i —(X +Y)D, + XYDy_1 = Z" 1(Z— X)(Z-Y), 
which may be verified at once. 


With this identity, Niewiadomski and Métrod proved (1913): 


(2R) Let x,y,z be positive integers such that x? + y*? = z*. Let 
d, = D,(2,y,2) for alln > 1. Then 2d, is divisible by dj} when 
n> 2. 


PROOF. We may assume without loss of generality that x,y,z are 
relatively prime. Indeed, let e = ged(z, y, z), v7’ = x/e, y’ =y/e, 2 = 
z/e. So a” +y” = 2’ and we may assume that 2’ is even, while 
y’, 2’ are odd. Let di), = D,(2’,y’, 2’), so d, = e"di,, in particular 
d, = ed, and we note that d, 4 0. Hence 2d,,/d? = e"~?2d', /di’. 
Since n > 2, it suffices to show that 2d’, /d/,’ is an integer. 

With the assumption that e = 1, by Chapter I, (1A), there exist 
integers a,b with 0 < b <a, gced(a,b) = 1, such that 


x = 2ab, 
y= a? =. b, 
z=a’?+b?. 
If n = 2 then dz = 0 is divisible by d?. Similarly by (2.2), 


d3 = (1+ y)dz — xyd, + 2(z —2)(z —y). 
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Hence noting that d, = 2b(b — a), we have 


2d; = —4ab(a? — b?)d, + (a* + b*)(a — b)?4b° 
[2a(a +b) +a? + b’]d? = [(a + b)? + 207Idj. 


Assuming that 2d,_; and 2d,, are divisible by d?, it follows from 
(2.2) that 


Qdnii = (a+ y)2d, — ey - 2Zdy_y + 22z"""(z — 2)(z— y) 


is also divisible by d? because 


22"-1(z — 2)(z — y) = (a? + B?)" "(a — b)?40? = (a? + 07)" 12. 
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IX 
Interludes 9 and 10 


We shall need the Gaussian periods and the Lagrange resolvents and 
Jacobi cyclotomic functions in the study of Fermat’s congruence. 


IX.1. The Gaussian Periods 


Let q be an odd prime, p a primitive gth root of 1, h a primitive 
root modulo q, L = Q(p), B = Ziq], and let + be the generator of 
the Galois group of L | Q defined by 7(p) = p”. 

Every element a € DL may be indifferently written in a unique way 
as 


q-—2 q-2 : 
— a = hehe. 
a= ayp or as QS a; Pp 
i=0 j=0 


(with a;,a,;’ € Q); moreover, a € B if and only if each aj;, a,’ € Z. 
Comparing these two representations, and noting that 


Reni? ptt = =(P pass pt 2) 


it follows that Ao = —AQq-1)/2 and a; = a,’ — Aiq_1)/2 Where 1 = hi 
(mod q) (fori =1,...,q—2). 
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In the present situation, if g—1= fr, the r periods with f terms 
(relative to p and 7 or h) are: 
Ho = pt p+ ph pet ph, 
i, Sp ee ae 


Mpa = ph $e ph pl peep ph 


We have ey ju; = —l. For every j, we write uw; = pb; if 0 < 
jo < r—1 and j = jo (mod r). The periods py; are conjugate to 
each other: 7’(u;) = pi4; (for i = 0,1,...,q— 2, and any j). In 
particular, 7"(44;) = yj for 7 =0,1,...,r—1. 

Let L’ denote the subfield of L which is fixed by 7", so [L: L'] = 
f, (L': QU = f; the Galois group of L | L’ is generated by 7? and the 
Galois group of L’ | Q is generated by the restriction 7’ of r to L’. 
Let B’ denote the ring of integers of L’. 


(1A) 
(1) {to, Hi,--- ;r—i} ts a basis of the Z-module B’. 
(2) [= Q(Ho, pa »Hr—1); B= Z| H10, se 5s Ppea]. 
(3) {1,p, p?,-.., p+} is a basis of the B'-module B. 
(4) The polynomial of periods 
r—1 
(1.2) Fu,(X) = [[(X - os) 
i=0 


has coefficients in Z and it is irreducible. 


ProoF. (1) The elements pio, f1,... , 4p—1 are linearly independent 
over Z: if =} ay; = 0 (with a; € Z), replacing each u, by its 
expression, we have a linear combination of p,p”,... ,% which is 
equal to 0, and with coefficients 0. @o,a1,...,a,-1 € Z so each 
a; = 0. 

On the other hand, if a € B’ C B, we may write a = 74% a;p 
with a; € Z. Since r"(a@) = a then 


q-2 q-2 
hitr = hi 
) ap = ) ay,P 
1=0 1=0 
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and from the uniqueness of the expression, we deduce that 


a9 = Ap = + °° = A f-1)r5 
Q, = Gr41 = 00° = A f-1)r41; 
Qrp—1 Q2r-1 ere Qq-2- 


Hence a = S059 fly. 

(2) Clearly L’ D> Q(Mo,..- , Mp1) and B’ D Z[po,... , p—1]. The 
converse follows from (1.1). 

(3) Let G(X) = TI (X — p””) be the polynomial whose roots 
are the summands of the period fo. Then each coefficient of G(X) 
is invariant by 7’, hence it belongs to BN L’ = B’. 

Thus G(X) = X/+a,X/~!+---+a, and since p is a root of G(X), 
then pf = —(aipf-1+---+a,). So pf is a linear combination of 
1,p,...,p/—! with coefficients in B’. Multiplying the above relation 
successively by p,p?..., we deduce that p/t1, p/t?,... , p%! are also 
linear combinations of 1, p,p?,... , ef! with coefficients in B’. Thus 
every element of B = Z/[p] is a linear combination of 1,p,... with 
coefficients in B’. 

So {1, p,... ,p/~+} is a system of generators of the L’-vector space 
L. Since [L : L']| = f then {1,p,... ,p/~1} are linearly independent 
over L’, hence over B’. 

(4) The coefficients of F,,,(X) belong to Q, since they are invari- 
ant by 7; hence they are in B’NQ=Z. 

Since F;,,(4o) = 0, the minimal polynomial of fo divides F,,,,(X); 
its roots are all the conjugates of jo, so it must coincide with F,,,(X), 
which is therefore irreducible. 


It is not true in general that 


Z|Mo,+-- »fr—1] = Zo] = +++ = Z[uy—1]. 
For example, let q = 13, f = 3, r= 4, and k = 2. The periods are: 


We shall show that the unique expressions of j41, (2, Ug as poly- 
nomials in fo with rational coefficients require some non-integral 
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coefficients. Indeed: 


Ho = Ha + po, 
Hofla = fo + fla + fs, 
Hole = 3+ fi + Hs, 


and 


Mo = Hob + 2Mop2 = 6 + po + 3p + 3ps 
6 + Ho + 3(-1 — po — pe), 


hence 
fiz = 3(—Ho — 20 + 3). 

From this we obtain 

fi = Ho — 2p = 5 (2M + 35 + 440 — 6), 

big = —1— fo — pl — Me = $(—HG — 315 — Spo). 
It follows from (1A) that given i,j7,0 < i,j < r—1, there exist 
integers n;;, € Z(0 < k < r—1), which are unique such that pip; = 
ho Nijklle. More precisely: 


(1B) We have the relations 


r-1 


So Miltite = req — f (for 0k Sr 1), 


i=0 
where 


1 when f is odd andk =0 or r/2, 


1 when f is even andk =0, 
hy = 
O otherwise. 


PROOF. First we evaluate the product 


it lr f=" k+jr 
Hobe = (se ) (Si : 
1=0 j=0 


Writing 7 =i+1 (mod q—1) then the above product is equal to 
f-lf-1 


ihe S- ‘: peta) 


1=0 i=0 
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Let 
f= 
a hn’? (14+hktir) 
=e 
i=0 
If 1+ h***" 40 (mod q), there exists a unique t, 0 < t < q—2, such 
that 1+ h*t'" = h' (mod q); hence p/, is equal to the period p;. If 
1+ h**’" = 0 (mod gq) then pi, = f. Therefore, we may write 


(1.3) Hobe = Nef + MgoM@o + Mei fa +++ + Me r—1bn-1; 


with integers nz, > 0, Meo > 0,...,™Mx,r-1 = O. Now we determine 
Nk. 

(I) If f is even and k = 0, let f = 2f’, then 1+ Af" = 0 
(mod q) since fr = q—1. So py). = f. On the other hand, if 
0<i< f and yp,’ = f, then we have 1+ h'’” = 0 (mod gq), hence 
2ir =0 (mod q—1), that is, 2ir = mrf; but mf = 21 < 2f, m=0 
or 1. If m = O then 7 = 0, an absurdity because q is odd. Thus 
m=1,i= f/2. Therefore in this case n;, = 1. 

(II) If f is odd (hence r is even) and k = 1/2, let i= (f — 1)/2. 
Then 1+ h7/?(S-D/2)" = 0 (mod q), 80 p(_1)/2 = f. On the other 
hand, if 0 <i < f and yp,’ = f, we have 1+ h’/?+"" = 0 (mod q), 
hence r + 2ir = mrf; thus mf = 1+ 21: <1+2f; it follows that m 
is odd, som = 1 and i = (f — 1)/2. 

(III) We consider the remaining cases. If 1 + h**'’" = 0 (mod q) 
then 2k + 2ir = mrf and 0 < r(mf — 27) = 2k < 2r, thus mf —2i = 
0 or 1. 

If mf = 27 < 2f then m = Oorl, and &k = 0. If m = 0 then 
i =0, k = 0, an absurdity, since g is odd. Thus m = 1 and f is even, 
which is a case already studied. 

If mf = 204+1 < 2f +1 then m is odd, m < 2, som = 1, f 
is odd, i = (f — 1)/2 and also k = r/2, which was Case (II) above. 
Therefore, in Case (III), n, = 0. 

Since pop, is the sum of f? terms of the form p’ and since each 
period contains f such terms, all appearing with different exponents 
1,0<i7<q-1, it follows that 


Me + Meo + Mer tes t+ Meni = f. 
Applying the automorphisms 7‘, we obtain from (1.3): 


Mille+i = Nef + Me0fi + Me rpigi +--+ + Meg n—1fbr—1- 
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Hence, from S~7—) “4; = —1 we conclude that S772) pitise = re(q — 
1) — (Mp0 + M41 +++ + Mgr) = MeG — Ff. 


IX.2. Lagrange Resolvents and Jacobi Cyclotomic Function 


We shall use the following notations: 


e p,q are prime numbers such that q — 1 = 2kp; 
e ¢ = primitive Ath root of 1; 

e g = primitive root modulo p; 
e 
e 


K = Q(q); 
A=Z(¢); 

e o = generator of the Galois group of K | Q, defined by a(¢) = 
a 


e p= primitive qth root of 1; 
e L= Qo), B=Z[p); 
e 7 = generator of the Galois group of L | Q, defined by r(p) = 


h 
Ps 

}l0,+++ y{p—1: the p periods with 2k terms (relative to p, 7); 
L' = Q(Ho, « «+ 5 Mp1) = Q(uo) = ++» = Q(Mp-1); 


B’ = Z|po,.- + , fp—a]; and 
Tv’ = restriction of 7 to L’. 


Q(p, ¢) 
Q(p) = ie | 
eee L'(¢) 
- ee QQy=K 


We note that DM K = Q. Indeed, the prime q is totally ramified 
in LZ and unramified in K, hence it is both totally ramified and 
unramified in LA K,soLnN kK =Q. 

Thus Q(p,¢) is a Galois extension of kK with Galois group iso- 
morphic to the one of L | Q and generated by the automorphism 7, 
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defined by 


=¢, 


Similarly, Q(p,¢) is a Galois extension of L with Galois group 
isomorphic to the one of K | Q and generated by the automorphism 
o, defined by 


{HIE 


LeOoe 


It is convenient to work with indices, as we define now. 

If t is any integer, not a multiple of g, then there exists a unique 
integer s,0 < s < q— 2, such that t = h® (mod q). s is called 
the index of t (with respect to h,q), and we write s = ind;(t), or 
simply s = ind(t) if there is no ambiguity concerning the choice of 
the primitive root h. 

For example, ind(1) = 0, ind(—1) = (¢—1)/2. If t =t’ (mod q) 
then ind(t) = ind(t’) and if t,?t’ are not multiples of g then ind(tt’) = 
ind(t) + ind(t’) (mod q—1). It is also clear that every integer s, 0 < 
s <q - 2, is an index, namely s = ind(h*). 

We define the Lagrange resolvent (a, 3),, wherea € K, 6 € L, 
and 7 is the automorphism defined above: 


(2.1) (a,8), = 6+ ar(6) + a°77(8) +++» +a? 77? (8). 


We write more simply (a, 3) in place of (a, 3),. The element (a, (3) 
belongs to the field Q(¢, p). We note at once: 


(2A) For every n and B € Q(p): 


CF ((C", BY) = (CP, 8) and —(6", 8)? € QU). 


PROOF. 


CMOS BY) SOTO Cap) ca (B) Bese Ce re 8) 
= (20): 


since ¢? = 1. So 7 ((¢”, B)?) = (7(¢", B))? = CP -(C™, B)P = (C", B)?. 
Since (¢”, 3) is invariant by 7 then (¢”, 3)? € Q(¢). 


284 IX. Interludes 9 and 10 


We consider the resolvents (¢”, p”). With the index notation in- 
troduced above, we may write 


q-2 q-l 
(2.2) (os Pog = S- Cage = S- Cena) Be 
u=0 t=1 


The complex conjugate of (¢",p™), is (C",p™), = (C",p ™)r. A 
first result to record is the following: 


(2B) With the above notations, for every n=1,2,...,p—1: 


(oP = (¢", Mo) 7 = 0 


and it belongs to L’. 
PROOF. 


(22) NC pe Sep Cop ep sae ee 
=pt Co feet cele e 
ff pr. ze a scrips Calne 


p2P-1 


p(akt+Dp 
Tp 


ae ye pA se C(P-1)m ght 
= fot Crt + CP Uy 4 
(¢", Mo) +! € Ge 


Moreover, (¢”, to), # 0. Indeed, the periods jug, M1,..- , Mp—1, 
which are a basis of L’ | Q, are still a basis of L/(¢) | Q since this 
extension has still degree p. 


The theory of Gaussian periods, Lagrange resolvents and more 
general sums of roots of unity is very rich and important. We shall 
only indicate the results which will be needed in the sequel. 


(2C) Ifp\n, qm then (¢",p™) = (¢", p)g-"4™),_ In particular, 
(Cp Scope: 
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Proor. (¢”,p™) = S277 (7nd p™, But ind(tm) = ind(t)+ind(m) 
(mod q — 1), hence 


q-1 
Cea = Ce) S- Cuan 


t=1 


q-1 
= Cee S- Coats) ae 


s=1 


SC Ca): 


(2D) [fp yn then (¢",p)(C-”, p) = 


PROOF. 
q-1 q-1 ; 
( CG ind(t ) (S Ce 7 
t=1 s=1 


q—-l1q-1 
cane t)- and ts) Artes 


(ORD NG ep) 


s=1 t=1 


For each s let r be defined by the congruence t = rs (mod q). Since 
p|q-—1 then ¢4~' = 1, so the above sum is equal to 


—1lq-1 


Q 


Sy Crime) pete, 
s=lr=1 
But q—1 = 2kp so 
q-1 r P 
Ss Ce (r) _ as ad ok S- ¢ 
r=1 m=1 m=1 
hence we may add $747; ¢”™4) — 0 and write 
q-l1q-1 
n —n nind(n) .(r 
(CPG i a ee 
s=0 r=1 
q-1 q- 
_~ Coma) > a) ; 
r=1 s=0 


But 


3 (r4ijs__ f @ whenr=q—l, 
ar ~ | 0 when l<r<q-2, 
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hence (C", p}{C-", p) = qorindla-2) = gorind-Y) = gene? 


(2E) = |(¢",p)| = when p fm. 
PrOooF. The absolute value of Am = (¢’", p) is 
[Am| = AmAm = (6, pC p') 
= (6, p)(O-™, pygrmind2) = gore? 
= on"? =q, 
using (2D), (2C). 


= gon 


Now we give an expression of the Gaussian periods in terms of the 


Lagrange resolvents: 


(2F) Ifp/n then 


L116 


a amg (C7, 9 
PROOF. We compute the right-hand side: 


Seca) = Soom (Scher) 


j=0 j=0 u=0 
—1 q—2 
= yy ie n) oh 
j=0 u=0 
—1 q—2-n 
yr 
j=0 t=—-n 
q-2-n p-l 
See. Ee 
t=—-n j=0 


But ae ¢/* = p when p | t, and equal to 0 otherwise. Thus the 


above sum is equal to 


q-2-n 
ttn 
o( > p" ) =m 
i=—n, pit 


x 
The Local and Modular 


Fermat Problem 


In this chapter we investigate some natural modifications of the origi- 
nal Fermat problem. In the first section, we search solutions in g-adic 
integers. In the following section we consider Fermat’s congruence. 


X.1. The Local Fermat Problem 


Our aim is to show that for every prime q, Fermat’s equation has so- 
lutions in nonzero q-adic integers. Our tool will be Hensel’s Lemma. 


(1A) For every prime q and every prime p, the equation X?+Y” = 
Z” has nontrivial solution in q-adic integers. 


ProorF. If p = 2, this is trivial, by Chapter I, (1A). So we may 
assume p # 2 and for convenience, we consider the equation X? + 
YP+ Z?=0. 


First Case: q # p. 

Let F(X) = X? + q? —1; then XP + q?-1= XP-1=(X 
1)(XP-1+ XP-274.--+X +1) (mod gq). Since 1 mod q is not a root 
of XP-1 + XP-2 4... + X +1 modulo q, by Hensel’s Lemma (see 
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Chapter V, (1T) and (1U)), there exists a g-adic integer a such that 
a =1 (mod gq) and a4 + p?+ (-1)? = 


Second Case: q = p. 

Let F(X) = X? + p? —1 and Go(X) = X — 1, Ho(X) = XP + 
XP-?4..-+X+1. By Chapter II, (4B), the resultant of Go and Hp is 
equal to R = Ho(1) = p, so vp(R) = 1. Since Go(X)Hp(X) = X?-1 
then F(X) = Go(X)Ho(X) (mod p’). But p > 3 > 2u,(R), so we 
may apply Hensel’s Lemma of Chapter V, (1T) and (1U). Thus there 
exist monic polynomials G(X), H(X) in Z,[X] such that G(X) = 
Go(X) (mod p’-'), H(X) = Ho(X) (mod p?-') and F(X) = G(X) 
xH(X). So G(X) = X —a where a is a p-adic integer, a = 1 
(mod p?~'), soa #0 and F(a) = 0, that is, a? +p? +(—1)? = 


In the proof we obtained solutions in which one of the numbers 
was not a unit in the q-adic field. So it is natural to ask if there is 
always a solution in q-adic units. In the sequel we give results which 
may be found in Kl6ésgen’s paper (1970). 


(1B) Letn>1 and let p be an odd prime. The following conditions 
are equivalent: 
(a) There exist integers x,y,z, not multiples of p, such that x?” + 
ye + 2?" =0 (mod p**?). 
(b) For every m > 0 there exist integers &m,Ym,Zm, not multi- 
ples of p, such that x”, + y?” + z?° =0 (mod p"*!*™) and 
Tm41 = Im (mod ie | Ymt+1 = Ym (mod pe) 
(mod p™*"), 


; em4+1 = zm 


PRooF. It suffices to prove that (a) implies (b) and we proceed by 
induction on m. From 2?” + y?” + 2" = 0 (mod p"*'™), with 
integers 2m,Ym;2%m not multiples of p, we may write x?” + yP + 
ze" = p'prttt+m with r’ € Z. Since p |} zm, there exists an integer 
r such that rz?,-! = r' (mod p). Let tmi1 = Lm, Ym+1 = Ym, and 
Zme1 = 2m — Tp. Then 


2 


x @ Md oa ona <a 


n 
Pp =. p™ p’—-1 n+ltm p p”—2,,2, 2(1+m) 
em41 = 2m — &m rp + ( Je rp 
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n+l+m ere) 
5) 


— p” os p’-1 
= 22 — 2h ~'rp (mod p 


because m > 0, p 2, so p”*?*™ divides all summands but the first 
two. Hence 


n n n+1 n n n n 
p p Pp — _P p Pp p"-1,,,.nt1+m 
em+1 =F Ym+1 oh em+1 = Un aa Ym + zm — ®m rp 
ar =2i. pre 
= 0(ned p""7"""). 


We note that we may in fact take 7, = 20, Ym = Yo for every 
m > 0. As a complement, we note the analogous result for p = 2: 


(1C) Letn>1. The following conditions are equivalent: 
(a) There exist odd integers x,y,z such that 


a ty +27" =0 (mod 2”*”), 


(b) For every integer m > 1 there exist odd integers 2m, Ym, 2m 
such that 


4a ee = 0 mad BFF) 


and Lm41 = Lm (mod 2™*1), Ymir = Ym (mod 2"), zat 
=z, (mod 2”*1). 


PROOF. The proof is quite similar. It suffices to note that if z+, = 
Zm — 72™*!, then 


7 n n 2 ne 
2 en ND, 2?-—1 nt+tl+m 2” 2, 252(1+m) 
emit = em tem Te +(3 Zo Ped 


= (:)) go OY) fh Renee 


Di rane 


2n Qe 71 n+lt+m 
lamas (mod 


= Re ee 


because 2”~!+20+™) divides all but the first two summands and n+ 
2+m<n—1+2(1+™m) since m > 1. 


Concerning the solutions of Fermat’s equation by p-adic units, we 
have: 
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(1D) Let p be an odd prime. The following conditions are equiv- 
alent: 


(a) There exist units a, 3,7 of Lp such that a? + BP +7? =0. 

(b) There exist integers xo, Yo, 20, not multiples of p, such that 
x + yp + 25 =0 (mod p’). 

(c) For every n > 0 there exist integers Ln, Yn; Zn, not multiples 
of p, such that x? + y? + z? =0 (mod p"*?) and fn41 = Ln 
(mod ae Yn+1 = Yn (mod Gers 2n41 = 2n (mod pera) 


PROOF. (a) > (b) We write a = xo +a0'p, 8 = yot B'p, y = 20+ 
where 29, Yo, Zo are integers, 0 < x9, Yo, 20 < p—1, anda’, f’,7' € Das 
Since a, (,y are units, then p / xoyo2. From a? + BP + 7? = 0 it 
follows that 2} + y} + 26 =0 (mod p”). 

(b) = (c) This was proved in (1B). 

(c) = (a) The sequences of integers (%n)n>0; (Yn)n>0; (Zn)n>0 are 
p-adically convergent, since %4, = Xp, (mod p”*!), ynit = Yn 
(mod p”*?), 2n41 = Zn (mod p”*') for every n > 0. Let a= limz,, 
6 =lim yn, y = limz,. Since x? + y? + z? = 0 (mod p"*?) then, at 
the limit, a? + 6G? +7? =0. 


By the above proof, the conditions of (1D) are equivalent to: 


(a’) There exist integers x,y, not multiples of p, and a unit y € 
Zp, such that x? + y? + y? = 0. 


Similarly, we have: 


(1E) Let q,p be distinct primes. The following conditions are 
equivalent: 


(a) There exist units a, 3,7 of La such that a? + BP +7? =0. 

(b) There exist integers xo, Yo, 20, not multiples of q, such that 
x + yp + 26 =0 (mod q). 

(c) For every n > 0 there exist integers Ln, Yn; Zn, not multiples 
of q, such that x? + y? + z? =0 (mod q"*"), and an41 = Zn 
(mod rie Y Yn+1 = Yn (mod Pas Zn+1 = 2n (mod gery 


PROOF. (a) > (b) We write a = x9+a0'q, 2 = yot O'G, Y = 20+7'9; 
where 2, Yo, 20 are integers, 0 < 29, Yo, 29 < q—1, anda’, 3’, € La: 
Since a, 3,7 are units then q / xoyozo. From a?+3?+~7? = 0 it follows 
that x} + yb} + 2 =0 (mod q). 
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(b) = (c) We prove the statement by induction on n. It is true 
for n = 0 and we assume it true for some m > 0. Let 2, Ym, Zm be 
integers, not multiples of g, such that x? +y? +2? =0 (mod q™*?). 
Hence x?, + y?, + z?, = r’q™* where r’ is an integer. Since g | zm, 
there is an integer r satisfying the congruence rpz?~' = r’ (mod q). 
Leptin = ta Ved = ye and She) See Sts Then 4 = 
(2m — rg™tt)? = 2? — pzP-trg™t! (mod q*?) and 224, + yaa t+ 
a oe eh pee gett ea (n= pee gs ae 0 
(mod q’”*?). 

(c) = (a) The sequences of integers (%n)n>0, (Yn)n>0. (Zn)n>0 
are q-adically convergent, since 2,4; = », (mod q”*!), Yn41 = Yn 
(mod g”"*"), 2n41 = Zn (mod q"*') for every n > 0. Let a = limz,, 
3 =lim yn, y = limz,. Since x? + y? + z? =0 (mod p"*') then, at 
the limit, a? + 6? +7? =0. 


As in (1D), the conditions of (1E) are also equivalent to: 


(a’) There exist integers x, y, not multiples of g, and a unit y € Lq 
such that 2? + y? +7? = 0. 

We conclude by noting that Fermat equations (for n > 3) provide 
an interesting example where there is a nontrivial solution in every 
q-adic field (by (1A)) and only the trivial solution in integers, as 
proved by Wiles. 
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X.2. Fermat Congruence 


We shall study the congruences 
(2.1) X"™+Y"+4+ 2" 
(2.2) x”"+4+yY" 


0 (mod q), 
Z" (mod q), 


where gq is an odd prime number, n > 0, and gq does not divide n. 
Let Neg) = #3 9,2). | 1 Soe << Ge sa ee 


0 
(mod q)} and N'(n,q) = #{(2,y,2) |1<a,y,z2< qa" +y" = 2" 


z 
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(mod q)}. Clearly, if n is odd then N(n,q) = N’(n,q). In this 
connection, we consider the following problems: 


(1) To determine when N(n,q), N’(n,q) are greater than 0. 
(2) To find upper and lower bounds for N(n,q), N’(n,q). 
(3) If possible, to calculate the values of N(n,q), N’(n, q). 


We recall from Chapter IV, (2A), that if ged(n, q—1) = 1 (hence n 
is odd) then N(n,q) = N’(n,q) > 0. We have also shown in Chapter 
IV, (2D), that if p and gq = 6kp+ 1 are primes then N(p,q) > 0. 

The following implication was proved by Libri (1832, p. 275) and 
again by Pepin (1880), Pellet (1887), and Matthews (1895). 


(2A) Let p be a prime number. If there exist infinitely many 
primes q such that N(p,q) = 0 then Fermat’s last theorem is true 
for the exponent p. 


ProoF. Assume that there exist nonzero integers x,y,z such that 
x? +y? +z? = 0. If q is any prime number such that gq > max{|z}, ly], 
|z|} then 2? + y? + z? =0 (mod q) and N(p,q) > 0. So N(p,q) = 0 
for only finitely many primes q, proving the statement. 


This result shifts the proof of Fermat’s last theorem to the proof 
that N(p,q) = 0 for infinitely many moduli g. The fact is that we 
shall actually prove the opposite, namely for every p there exists a 
prime qo(p) such that if g > qo(p) then N(p,q) > 0. 

Before proving this theorem, we describe some of the numerous 
special results concerning these congruences. 

Legendre (1830) showed that N(3,7) = N(3,13) = 0 and N(5,q) 
= 0 for g = 11,41, 71, 101. 

Libri wrote a series of papers (1824, 1832) in which he exposed a 
method to compute the number of solutions of very general congru- 
ences. Libri calculated N(3,q) for many primes q = 1 (mod 3) and 
showed that there exists qo(3) such that if g > qo(3) then N(3,q) > 0; 
it should be noted that some of his calculated values were incorrect. 
These results were published again by Pepin in 1880 (see also his 
paper of 1876). 

Since g = 1 (mod 3) then —3 is not a square modulo gq and there 
exist integers 1,m of the same parity such that 4q = 1? + 3m?. This 
may be explained by considering the decomposition of g as a product 
of elements in the field Q(,/—3). From the fact that —3 is not a 
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square modulo q then q = aa’ where a = (1+ m/—3)/2, a’ = 
(1 — m\/—3)/2 with l,m integers, both even, or both odd, so 4q = 
l? + 3m?. We choose a representation with minimal |/|. We note 
that | is not a multiple of 3; by changing / into —1 (if necessary) we 
may assume that / = 1 (mod 3) and this implies that | is uniquely 
defined with the above property. Pepin showed: 


(2.3) N(3,q) = (¢—-1)(q—-8+ 1). 


Here are some numerical examples: 


28 =14+3x3% = N(3,7) =6(7—-8+1)=0; 

52 =524+3x32 = N(3,13) =12(13-8—5) =0; 

76 =1743x5? = N(3,19) =18(19-8 +1) =216; 

124=4+43x6? + N(3,31) =30(31—8+4) =810. 
N( ( 


Pepin noted that since | > —2,/q then N(3,q) > /@(./g — 2) — 
Hence if g > 19 then N(3,q) > 0. 

Libri established that for every prime p there exists qo(p) such 
that if g > qo(p) then N(p,q) > 0; however, he gave no bounds for 
N(p,q) or a way of computing qo(p). Pellet used another method to 
show the same result in 1887; in a later note (1911), Pellet gave a 
bound for N(p,q), but his value was erroneous. 

In 1837, Lebesgue considered arbitrary polynomial congruences 


(2.4) F(X,...,Xn) =0 (mod gq), 
where s > 2, q is an odd prime, and F € Z[X,... , X;]. Let 


N Silane. 482) | lS ae sg 
for alli =1,...,s, such that F(21,...,2,) =0 (mod g)}, 

No = FE Gees ay) | 0O< 2, <q-l 
for alli =1,...,s, such that F(21,...,2,) =0 (mod g)}. 
In the polynomial F%~' let A (respectively, Ag) be the sum of the 
coefficients of all monomials eX;1 --- X£* such that q—1 divides each 


e; (respectively, each e; is greater than 0 and divisible by q— 1). 
Lebesgue showed that N, No satisfy the congruences 


No = (-1)**" Ap (mod q), 
N = (-1)°(1— A) (mod q). 
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Lebesgue applied his method to congruences like 
(2.5) AX? +--+ +A,X? =0 (mod q), 


with n > 2, s > 2, each A; is a nonzero integer, g is an odd prime, 
q = 1 (mod n). He gave an expression for the number of solutions 
N in terms of the periods of the cyclotomic equation. Lebesgue 
studied in detail the following special cases of the congruence (2.5): 
$= 2, n= 25 = 30 = 365 —3, 0 = 4. For 2.1) witha —3; he 
derived once more some of the results of Libri. 

Further results about the number of solutions of (2.5) appeared in 
Lebesgue’s paper of 1838. In 1909 two papers by Dickson appeared, 
and one each by Cornacchia and Hurwitz, dealing with these con- 
gruences. 

Cornacchia studied various special cases of (2.1) and (2.5) in detail 
and gave many explicit results, which had been in part previously 
indicated by Lebesgue, Pepin, and Pellet: 


(a) If nm = 2 and g=1 (mod 4) then 


9 
—— when 2 is a square modulo gq, 
N'(2, q) = 5 
—— otherwise. 


7 
—— when 2 is a square modulo gq, 
Nrag=\ %, 
—— otherwise. 


(b) For n = 3, q = 1 (mod 3), Cornacchia obtained once more 
Pepin’s result and showed that if N(3,q) = 0 then g = 7,13. 

(c) If n = 4 and q = 1 (mod 4), Cornacchia calculated N(4,q). 
Moreover, he showed that N’(4,q) = 0 exactly when gq = 
11,17, 29, 41. 

(d) If n = 6 and gq = 1 (mod 6) then N’(6,q) was also deter- 
mined. Moreover, N’(6,q) = 0 exactly when g = 7, 13,19, 43, 
61,97, 157,277. On the other hand, for the congruence X° + 
Y® + Z° = 0 (mod q) we have N(6,q) = 0 exactly when 
q = 7,13, 31,61, 67, 79, 97, 139, 157, 223, 277. 
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(e) If nm = 8 and g=1 (mod 8) then Cornacchia gave upper and 
lower estimates for N’(8,q). Moreover, N’(8,q) = 0 exactly 
when gq = 17,41,113. And for the congruence X*® + Y® + 
Z*® = 0 (mod q), he established that N(8,q) = 0 exactly 
when qg = 17, 41, 113, 137, 233, 761. 


In his papers, Dickson dealt with the congruence (2.1). His method, 
involving the periods of cyclotomic equations, led to lower and upper 
bounds for N(p,q), as well as an upper bound for qo(p), namely 


qo(p) < (p —1)?(p — 2)? + 6p — 2. 


As applications, Dickson showed that N(5,q) = 0 exactly when gq = 
11,41, 71,101 and that N(7,q) = 0 exactly when g = 29, 71, 113, 491. 
Using computations of Carey (1893) for squares and products of 
periods, Dickson applied his method to the congruence X*+Y+* = Z? 
(mod q). 

Hurwitz’s paper dealt with the more general congruence (2.5) with 
n = paprime. He considered the family of such congruences. For all 
possible values of the coefficients and indicated relations which must 
be satisfied by the numbers of solutions of these various congruences. 
From this information, Hurwitz deduced upper and lower bounds for 
the number of solutions of 


(2.6) AX? + BY? + CZ? =0 (mod q), 


where A,B,C are nonzero integers. He also determined a positive 
number qo(p) (depending on equation (2.6)) such that if g > qo(p) 
then (2.6) has a solution (x,y,z), with 1 <2,y,z<q-1. 

In 1917, Schur gave a proof that for every n > 2, ifq > (n!)e+1 
then X" + Y" + Z” = 0 (mod q) has a solution (x,y,z) with 1 < 
x,Y,z2 <q-—1. Schur’s proof was based on the following interesting 
combinatorial lemma: 


LEMMA 2.1. Letn > 1 and N > (n!)e +1. If the set of numbers 
{1,2,...,N} is partitioned into n disjoint subsets L,,..., Ln, there 
exists at least one subset L; such that if m,m’ € L; with m < m’ 
then m’—me L,. 


Among further developments, we want to report that the equa- 
tions 


AX’ + BY! +CZI =0 
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and 
Ay Xy + Ag X53? +--- + AXP" =0, 


with coefficients in a finite field with g¢ elements (d > 1) and not 
necessarily equal exponents, have been the object of numerous pa- 
pers. See Mitchell (1917), Vandiver (1944, 1945, 1946, 1947, 1948, 
1949, 1954, 1955, 1956, 1959), Hua and Vandiver (1948, 1949), E. 
Lehmer and Vandiver (1957). For the vast, more recent, literature, 
the reader should consult the appropriate sections in Mathematical 
Reviews. 

In his paper of 1949, Weil traveled a historical panorama of the 
evolution of Gauss’ original method to deal, by means of Gaussian 
sums, with the congruence AX* — BY? = 1 (mod q), where q is a 
prime, g = 1 (mod 3). These ideas were applied subsequently to 
wide classes of congruences. They were used by Hardy and Little- 
wood in connection with Waring’s problem; Hasse expressed rela- 
tions between Riemann’s hypothesis for function fields and various 
kinds of exponential sums, and Weil published striking and definitive 
results on this question (1928). 

Here we single out the following specific result (see Vandiver, 1946, 
pp. 47-52; Hua and Vandiver, 1948, pp. 258-263), which is more 
directly related with our subject matter. 

Let s > 1, let Aj,...,A, be nonzero integers, let n,,...,n5 be 
integers, let q be an odd prime, and let d; = gced(q — 1,|n;|) > 1 
fori = 1,...,s. Let N denote the number of solutions in integers 
(a,...,%5) with 1 <a; <q-—1 (fori =1,...,8), of the congruence 
(2.7) AXP +++: +A,X>* =0 (mod q). 

Then 
(Ca ae 
q 
In particular, there exists a positive number qo such that if g > qo 
then N > 0. Another proof of this theorem, using the theory of 
group characters, was given by Feit (1967). 

After this survey of results which are related to Fermat’s congru- 

ence, we shall give the proof of Dickson’s theorem. For this purpose, 


we recapitulate the following notation and facts from Chapter IX. 
Let p > 3, gq = 2kp +1 be prime numbers. Let 


(= 1)" 


decdgi? < N< dyes. 


g be a primitive root modulo p; 
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¢ be a primitive pth root of 1; 
h be a primitive root modulo q; and 
p be a primitive gth root of 1. 


The p periods of 2k terms in Q(p) are 


io = pt ph + ph pee + pher” 
Ly = ph ae prt ae ph p+1 Fh eh: hth pee, 
(2.8) het 


ate ae apy shou 
In Chapter IX, we defined Lagrange resolvents for every 7 = 0,1,..., 
p—l: 


q—-1 
(2.9) ci inds (4) 6 


t=1 


where ind; (t) = s, 0 < s < q—2 whent = h* (mod q). In particular, 
No = (1p) = I ph = 1. 

For easy reference, we recall the following results from Chapter IX, 
(2D), (2E), (2F): For j =1,...,p—1, 
(2.10) Ajrp-j = 
(2.11) Jj? = AyAj = 
(\; denotes the complex-conjugate of 2j;), 


L1S 


(2.12) a Hd, 


We shall give a simplified proof of Dickson’s theorem, following 
Klésgen (1970). First, we give an expression of N(p,q) in terms 
of the periods pi: 


(2B) 


N(p.a) = |(q- 1) + (g— Dp? oa! 
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PROOF. To begin, we note that if x,y,z are integers such that 1 < 
x,y,z <q-—1 then 


= picerty te?) — 0 when 2? + y? + 2? £0 (mod q), 
7 q when x? + y? + 2? =0 (mod q). 
So 
q-1 q-1 
qN (p,q) = a 
x,y,z=1 \t=0 
q- q-1 


1 
_ tx? ty? tz? 
= ( pp p 
t=0 \az,y,z=1 


Ift=h' (mod q) and x = h) (mod q) (where 0 < i, 7 < q— 2) then 


wp = TG ph” = pai. Since pj; = py when j = i (mod p), 
then 
p-1 


aN (p,@) = (q— 1)? + 2k} 0 py}, 


1=0 


because q = 2kp + 1. Therefore 


1 
N(p,q) = : (q—1)? + (q-1)p” si). 


And now, we prove Dickson’s Theorem: 


(2C) We have: 


(1) (@-V[at+1—-3p—- (p- 1)(p— 2),/q] < N(p, 9) 
<(q—-l)[lg+1—3p+ (p— 1)(p- 2),/d. 

(2) Ifq > (p—1)?(p—2)? + 6p — 2 then the congruence (2.1) has 
a nontrivial solution. 
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Proor. (1) By (2B) and (2.11), 


Nip.) = 2 \(g-1)° + @- Dr? Y ia Hy} 


1 see Se are 
= (q ee > Ui+i2+5s Ni AjoAjs 
qd (0: Fossa 
= 
= 3 ga) ‘ i(j1t+jotJs) 
= (Gaye Se Ap wks =e 
J15J2;J3=0 
But 
=1 : ; : 
> CG t32+33) =< { 0 when Jit ja Ja # 0 (mod P); 
p when ji + jo +j3 =0 (mod p). 
Thus 


p-l 


1 
N(p,q) a= q (q-1)°+(¢q-1) 3 Aj AjoAig 
jeddgtisce Goals) 


Since Ay = —1, it follows from (2.9) that the last sum of products 
Vj, Aj2Aj 18 ual to 


p-1 
Nj + 3Ao S| AjAp-j + S = -1— 3q(p—1) +S, 
j=l 
where S' = aes joks Oi1>j2Aj3- Hence 
= 
N(p.4) = Iq - 1)? 1 - 3a(- 1) + 5), 
* N 1 1 
Pd) _ 1 (@ — sqp-+q-+ 8) =¢—-3p+1 +28 
gol 4@ q 
and N(p.@) : 
et —(q+1-3p)| = =5I. 
Gs q 


By (2.10), |A;| = ,/g. We note also that for every 71,1 <j, < 
p—1, there are p — 2 pairs (jo, js), 1 < jo,J3 < p—1, such that 
ji+jo+Js =p or 2p, namely (1,p— ji —1), (2,p— ji —2),..., (p- 
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q= 15.1); and (p—1,p—ji +1), (p—2,p—ji+2),.. tad) (p—ji+l1,p—1). 
Hence |S| < (p — 1)(p — 2)q°/”, so 
N(p,q) 


7a — (q+1—-3p)| < (P-1)@- 2)/a 


and we conclude that 


(q—ll[q+1—3p— (p—1)(p— 2) Va] < N(p,@) 


< (¢-1)[¢+1-3p+ (p— 1)(p - 2) 4]. 
(2) For later use, we prove more generally that if v is an integer, 
v > 0, and if ¢ > (p—1)?(p— 2)? + 2(pv — 1) then g+1-(p—1)(p— 
2),/q — pv = 0. Choosing v = 3 this gives the statement (2), in view 


of (1). 


The inequality 


a? +28 > ar/a? + 48 
(for real numbers a, 3), which is easily verified, implies when a = 
(p—1)(p— 2), @= py —1: 


(p — 1)*(p — 2)? + 2(pv — 1) 
2 (pip 2)4/ (pl p— 2)? + App — 1): 

) 

) 


Let 6 = (p—1)?(p—2)?+4(pv—1) > 0 and consider the polynomials 


fat SoS Vp aL (py 1), 


having discriminant 6. 
It suffices to show that 


a> eevee 


indeed, this implies that f(,/q) = 0, that is, g+1—(p—1)(p—2)./q 
pv > 0. We have 
4q > 4[(p — 1)?(p — 2)? + 2(pv — 1)] 
2[(p — 1)? (p — 2)? + (pv — 1) + (p— 1)(p- 2) V9 
= (p—1)°(p— 2)? +6 + 2(p— 1)(p— 2) V6 
= [(p— 1)(p- 2) + V6), 
so /q > ((p — 1)(p— 2) + V6) /2, as was required to show. 


IV 
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Taking v = 3, it follows from (1) that 


N(p,q) > (¢-1)[(¢ +1) — 8p — (p — 1) (p — 2)/q] = 0. 


The upper bounds for qo(p) given by Dickson are not sharp, as we 
see by explicit computation: 


Dickson bound Actual value 
qo(3) < 20 go(3) = 13 
qo(5) < 172 qo(5) = 101 
qo(7) < 940 — qo(7) = 491 


As a complement, Mantel showed in 1916 that if N(p,q) = 0 then 
q must be of the form g = 6mp/(p — 3) — 1 (for some integer m). 

Taking into account (2C) and the result of Chapter IV, (2B), if 
k>1, q=2kp+1 isa prime and q > (p—1)?(p — 2)? + 6p — 2 then 
q divides the Wendt determinant W2,. Therefore, for each prime p 
there exist at most finitely many integers k > 1 such that gq = 2kp+1 
is a prime not dividing W.2,. What is not known is whether, for every 
prime p, there exists actually one prime gq with the above property 
(see Flye Sainte-Marie, 1890, and Landau, 1913). 
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X.3. Hurwitz Congruence 


In this section we give the theorem of Hurwitz concerning the con- 
gruence 


(3.1) A,X? +---+ A,X? =0 (mod q), 

where p > 3, q = 2kp+ 1 are prime numbers and A;,...,A, are 
nonzero integers. Let N = N(Aj,...,As,p,q) be the number of 
nontrivial solutions of (3.1), ie., of (a1,...,25), with 1 <a; < q-1, 


such that S>;_, Aiv? = 0 (mod gq). If h is a primitive root modulo 


X.3. Hurwitz Congruence 305 


q, let a; = ind)(A;) (with 0 < a; < q— 2), so A; = h% (mod q), for 
a Renee 

Thus N is equal to the number of (t;,... ,f;), 0 << t; < q—2, such 
that )7j_, he’ +t =0 (mod q). 

We consider the function y : Z — {0,1} defined by 


_ f 1 when q|z 
Oe when g Jz. 


Then 


(3.2) aes ie), 


t1,... ts=0 i=1 


Noting that if t; = t/ (mod 2k) then APite = hPtite (mod q); so 
(3.2) may be rewritten as 


(3.3) N=p* 2 x (> prem 


tise =0 w=1 
For convenience, we introduce the following “symbol”: 


2k-1 


(3.4) [seed = aR pS es), 


ts=0 


which is a nonnegative rational number. We may rewrite (3.3) as 
follows: 


(3.5) N =2kP aj; 22: 50) = (GQ 1)0" “eh, e505); 


and to determine N we are led to study the symbol [a1,... , a,]. 
First we note that [a,] = 0, since g /h?" 1, for every t1,0<t, < 
q—2. 


LEMMA 3.1. 
[ je 1 when a; = az (mod p), 
1742) ) 0 when a, # a (mod p). 
PROOF. q divides h?!+% + AP’2+% if and only if hPat = —pPete 


= A\9-1)/2+Pt2+22 (mod q); this is equivalent to pt, +a, = (q—1)/24 
ptz+az (mod q—1). Now if a, = az (mod p), say ag = mp+ay. For 
every tz, 0 < te < 2k—1, let t1,0 < t, < 2k—1, be the unique integer 
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such that t; = k +m-+ t, (mod 2k). Then (q—1)/2 + ptz + ag = 
p(k + tz +m) +a, = pt; + a, (mod 2kp). So 


2k-1 


S- x( (prere Sera 


ty ,t2=0 


[a, a9| = 


2k 


Conversely, if for some ty, 0 < tg < 2k — 1, there exists ¢,, neces- 
sarily unique, such that 0 < t; < 2k—1 and pt; +a, = pta+a2+kp 
(mod q—1) then p(t; —t2—k) = az—a,; (mod 2k), hence p | az2—a1. 
So if aj # a2 (mod p) then y(h?t™ + hP?+) = 0 for all ti, te, so 
(a1, ay] = 0. 


Now we collect some easy facts about [ai, ... , ds]: 
LEMMA 3.2. (1) The value of [a,,... ,as] remains unchanged by 
any permutation of ay,... ,Qs. 
(2) [a1,...,@s] = [a},...,a,] whenever a, = a‘, (mod p),..., 
a; =a’, (mod p). 
(3) [ar tu,...,@;+u] =[ai,..., as] for any integer u. 
PRrooF. (1) This is obvious from the definition of [a1,... , as]. 


(2) Indeed, let a; = pr; + aj and let t; + r; = t, (mod 2k) where 
0<t <2k—1. Then AP“+™) = prt (mod q) and 


aceite a by (Soe) 


«ytg=0 w=1 


2k-1 
1 


Ik, > 2 oe 
uy 2k-1 
BE (Se) etn 


ote =0 11. 


(3) Since h“ 40 (mod gq) then for every t; = 0,1,...,q—2: 


x (>: oe) =¥ (>: ee) , 
i=l i=1 


hence [a, +u,...,@, + u] = [a1,... , as]. 
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LEMMA 3.3. [a1,...,@s] is a nonnegative integer and equal to 
2k-1 
far... yas] = Dx (WPT 4p Aptos tert 4 pee) 
tyre th SEO 
2k—-1 
— Se x(hPiter fevise tp FPtea tee pee): 
tj. st) _,=0 
PROOF. Given t),... ,ts-1, 0 < t; < 2k —1, for every t, let t),..., 


t,_, be such that 0 < t'; < 2k—1 and t; =t, —t, (mod 2k). Noting 
that h?'s #0 (mod q), we have 


[Gisee eg 
1 k- k-1 s 
= OR ps x (= i) 
ts—_1=0t,;=0 i=l 
1 2k—-1 2k—-1 


Se oh S- S- x (APes (ht tar 4... 4 ppt atest + he) 


2k , 
tj. t)_,=0 ts=0 
1 2k-1 
= —2k pptitar 4 pptatar 4... 4 _Ptiatasa 4 pas), 
ae DB x( + feet +h) 
UF, easth 40 
Hence [a,,... , as] is a nonnegative integer. For the last equality, we 


note that a, = a, + (q—1)/2 (mod p) and h'%-)/? = —1 (mod q). 
Then, using Lemma 3.2 and what we have just proved, 


qd—- 
Ce remerere | fas. ,As—1, A, + ——— 


2 
2k-1 
= S- x (AP te foieee fb APbs-1tOs-1 _ he): 
ti... 81, =0 
LEMMA 3.4. If r,s >1 then 
laayceex > As, by,..- 5 b,.| 
p-1 
= 2klay,... 5 as|[b1,--- 5 b,.] + 5 [ai,... 5 as, C|[bi,... sige]: 
c=0 


Proor. Let A; =h™ (mod gq), B; = h (mod q). Then 
Dh aie ses Ges Dis cea ysDel UNG 
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the number of solutions (x1,..., %s,Y1,---, yr) with 1 < a;,y; < 
q—1of i, AX? +5, BY? =0 (mod q). Let N’ be the number 
of solutions (41,..., £5), with 1 < 4; < q—1 of 0)_, A:X? = 0 


(mod q) and let N” be the number of similar solutions of }7_, By Y/’ 
= 0 (mod q). 

For every d = 0,1,...,q@—2 let Ni be the number of solutions 
(@,.-.,%5), 1 <a; <q-1, of 


SS A,X? +h’ =0 (mod gq) 


i=l 


and let N/ be the number of solutions (y1,..., y-), 1<y; <q-1, 
of 


S° BY?p —h?=0 (mod q). 
j=l 


Then I= Dh hs wevic Ws) AVC = Qhp" (Biya 41 Del 
2k-1 
ee 2 x (Some +n) =p" |Gys.c.3 7 Og; al 
. t4=0 i=1 
(as follows from Lemma 3.3), 
2k—1 
NED i xX Sonnets — pl =p'|a,... , ay, dl 
,t=0 j=l 


(as follows from Lemma 3.3). 
We have N = N’N” + 374-5 NN‘. But 


[Bissae g Beg) = lee sc. oe ee|y (Binwor 4 ied) biners 5 Ory), 


when d= c (mod p). Therefore 


2kp**"[ar,.-. » As, bi, ees , b,| 
SAR pl lag ces eagl [Dia es 32D, | 
p-l 
+2kp" S “lar, sheng gy ClO ty ave ates el 
c=0 


and dividing by 2kp**", we have the required relation. 
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LEMMA 3.5. For any integers aj, G2, C: 
p-l 
S “Jai, a2,¢+ d] = 2k — [ay, ao]; 


d=0 


in particular, the above sum is independent of c. 


PROOF. 
p-l p-1 
S [ai,a2,c+d] = leks 1, G9] 
d=0 


d= 
p- 2k-1 
aap i DOP aie alee mm) : 


d=0 \t,u=0 


by Lemma 3.3. Let U, be the set of all u,0 <u < 2k —1, such 
that h?“t™ + h* = 0 (mod q). Then the number of elements of 
Uy is #U, = 2*' v (bette + h%) = [a1,a2], by Lemma 3.3. Let 
Uz be the set of all u,0 < u < 2k —1, not belonging to U;’, so 
#U, = 2k — [a;, a2]. We may write 


p-1 p—-12k-1 


Ddiler,aa,c+d] = Dy Dd) Dd x(nrrere + nenrm + he) 
d=0 ucU; d=0 t=0 
p-12k—1 


if Se +S os Tee are afi puta 4 AP?) 


u€U2 d=0 t=0 


If u € U,; then y(hPttete + Apert + p22) = y(hP'tete) = 0. If u € Un, 
there exists exactly one couple (d, t) such that h?téte+hPera+he = 
0 (mod q). So S22 )[a1, a2,¢+ d] = #U, = 2k — [ay, aX]. 


LEMMA 3.6. For any integers a1, G2, 03, Q4: 


p-l 
Sola + d, a2 + d, a3, a4] 


d=0 


= (q— 1)[a1, a2][as, a4] + (2k — [a1, ae])(2k — [a3, a4). 
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ProoF. By Lemma 3.4 we have 


d=0 d=0 
p-l 
a y [a, + d, az + d, cl[az, aa, c] 


d,c=0 


p-1 Deel: 
S [ai + d, az + d, a3, a4] = 2k ( js +o +d) [a3, G4] 


= 2kplay, a2] [a3, a4] 


p-l p-l 
a (S tenant) (Stes ei 

d’=0 c=0 
(where d’ = c—d (mod p),0 < d’ < p—1), by virtue of Lemma 
3.2. The above sum is therefore equal to (q—1){a1, a2] [a3, a4] + (2k — 
(a1, @2])(2k — [a3, a4]), by Lemma 3.5. 


For any integers n,m let 
p-1 
(3.6) Oe Sela m+ nd, 0]. 
d=0 
We have: 


LEMMA 3.7. Qom = 2k — [m,0], Q1m = 2k — [m,0], Qnym = Oni m! 
when n =n! (mod p),m=m! (mod p) and 7?) Onm = 4-2. 


m 


PRooF. By Lemmas 3.2 and 3.5, 


p-1 p-l 
dom = D[d,m, 0] = S7[m, 0, d] = 2k — [m, 0}. 
d=0 d=0 
Similarly, 
p-l p-l p-1 
Aim = So[d,m + d, 0] = S—[0,m, d) = S"[m,0, d) = 2k — [m, 0) 
d=0 d’=0 d’=0 
(where d’ = —d (mod p),0 < d’ < p—1). From Lemma 3.2 it 


follows that Qn» depends only on the classes of n,m modulo p. 
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Finally, using Lemma 3.5, 


p-1 p—1 p-1 
ye Anam = > [d,m + nd, 0] 
m=0 m=0 d=0 
p—-1 p-1 
a S°[d,0,m + nd] 
d=0 m=0 
p-l1 


(2k — [d, 0]) 


I 
M4 


d=0 
= 2kp -—1=q-2. 


Next we consider the expressions (for any integers n,m) 


(3.7) 
p-1 
on a aoe e Bear Te S- [c,d + ne, O][e,d + m + ne, 0]. 
d,c,e=0 
We have: 
LEMMA 3.8. 
p-1 
Onm = Y{(a- 1[(n— Lj +m, O][nj +m, 0] 
j=0 


+ (2k — [(n —1)7 + m,0])(2k — [nj + m,0])} — 2k, 
and if n #0,1 (mod p) then 


_ f 2k(q-—4)+¢q when m=0 (mod p), 
aaa 2k(q —4) when m #0 (mod p). 


PROOF. 


p-1 p—1 
Cr = ( [d+ neollesd-+ m+ neo) 


p-1 
= ( [c — ne, —ne, d][e — ne — m, —ne ind). 
c,e=0 \d=0 
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By Lemma 3.4 the above sum is equal to 
p-l1 


S- ({e nc, —ne,e —ne—m,—ne— ml] 


c,e=0 


p-1l /p-1 
S- (Ste— ne tne kim, net ne +m, e.0) — 2k. 


e=0 \c=0 


Let 7 = e—c (for any fixed e), then c— nc + ne+m = (n—1)j9 + 
m+e,—nce+ne+m=nj+m. Hence the sum is equal to 


E [Elo—ni +m enema} ~ 


j=0 


p—-1 (p-1 
= {Se — ays +m + eesng-+m.0)h — 2 


e=0 
and according to Lemma 3.6 the sum is equal to 


p-l 


dD {q-I[(n — 1)j +m, O] [nj +m, 0] 


: + (2k — [(n — 1)j + m,0])(2k — [nj + m, 0])} — 2k. 
If n 40,1 (mod p) and if m= 0 (mod p) then 


anm = YA(g— Vlln— 1), 0]ln4.0 


+ (2k — [(n — 1)j, 0]) (2k — [nj, O]) } — 2k 
= (q—1) + 4pk? —2k-—2k+1-2k 
= q+2k(q—1)— 6k 
q+ 2k(q— 4). 
If n £0,1 (mod p) and m #0 (mod p) then a similar computa- 


tion gives 


Onm = 4(p — 2) + 4k(2k — 1) — 2k = 2k(q — 4). 
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LEMMA 3.9. If n # 0,1 (mod p), for every integer m we have the 
inequalities 


qg—2-—(p—1)/¢ < Donm <q—2+(p-1)V¢. 
PROOF. We have seen that 


Ono =O go tan t-:+a%,, = 2k(q—-4)+4¢ 


spl 
and if r #0 (mod p) then 

On = An 0Cn,r + An 1Anr+1 HPht ste Qn,p—-1An,r+p—-1 = 2k(q [= 4). 
Then 


(Qn. SS Ging)? a (Qn — Ole een) St ee (Ojo J 2 nen a) 


p-l1 p—1 
2 (Ss “.) —2 (s netic) 
c=0 c=0 


= 2(2k(q — 4) + q] — 4k(q — 4) = 2¢. 


But then 


2q > (Os oi = Oech) a f Obese aof are Chee 


D) ( Qn,m+r + fae) Soe = ee 
An,m 
: 2 2 


2 
Qn,m+r + Qn,m—r 
2{ An.m : 
¢ | 2 


IV 


Since \/q is irrational, we have the strict inequalities 
On.m+r + QAn,m—r 
al: > Gin 5 > —/¢. 


The above inequalities hold for every r = 1,2,...,p—1. Adding 
them up, and noting that 


p-1 pol 
Q m+r a Qn,m—r \ Qn,m+r as An,m—r 
2 D = One + De D 
r=1 r=0 
and that 
p—l pl 
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(by Lemma 3.7), then 
(p— IVa > (p— Vanm — (—Anm + 9-2) > —(p— 1) V4 


hence 


q—2-(p—1)V/@ < ponm <q—2+ (p— 1). 


After these lemmas, we turn to the consideration of the congruence 
(3.8) AX? + BY? + CZ? =0 (mod gq) 


(with A,B,C nonzero integers). Let A = h* (mod q), B = h? 
(mod q),C = h*° (mod q), with 0 < a,b,c < q—2. With these 
notations, we have: 


(3A) 
p-l 
pla, b, c| = 6k+2—- q-Vvur+ S- Qn,b—c—n(a—c)) 
n=2 
where 


v = [a,)] + [b, ¢] + [c, a] 
0 when a,b,c are pairwise incongruent modulo p, 
| 3 when a,b,c are congruent to each other, 
1 when two of a,b,c are congruent, but not the other one. 


PROOF. We shall compute the sum S$ = S7?— Ges d—ne Where d,e are 
integers. By definition, 
—1lp-1 
s= earn —ne+nj, 0]. 
n=0 j=0 
If 7 =e (mod p), then for every n we have d—ne+nj =d (mod p). 
If 7 #e (mod p) then {d—ne+ nj |n=0,1,...,p—1} is a set of 
pairwise incongruent integers modulo p. Therefore 
—lp-1 
S = ple,d,0] + Sy t0 
t=0 j=0 
p-l 


= ple, d, 0] a + (2k _ i, 0}), 


j=0 
j#e (mod p) 
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by Lemma 3.5. So 


S = ple,d,0|] + 2k(p — 1) + [e, 0] — SE 0] 


j=0 
= ple, d,0] + 2kp — 2k + [e,0] — 1 
ple, d, 0] 22 [e, 0] Tg 2 — 2k. 


But a0,4 = 2k —[d,0] and ay 4_. = 2k — [d—e, 0] = 2k — [d, e], hence 


p—l1 
ple, d, 0] =, S- On,d—ne + 6k — (q = 2) = {(d, 0] t [d, e| + [e, O}}. 
n=2 
Let e = a—c, d= b—<c; then pla, b, c] = pla—c, b—c, 0] = ple, d, 0] = 
ey Cao ated + ORG 2) 


After these preliminaries, we may prove the theorem of Hurwitz 
(1909): 


(3B) The number N of solutions (x,y,z), with1 <a,y,z<q-1 
of the congruence (3.8) satisfies the inequalities: 
(q-I[(q+1)- @-1)~-2)Va- py] < N 
< (q-I[(qa+1) + (p-1)(p- 2)Va — pv). 


PRooF. By Lemma 3.9 we have 
(p-2)[a-2-(P-1) Va] < PY ong o-nta-o) < (P-2)la-2+(9-1) V4], 
hence by (3A) we have 
p(6k + 2—q-—v)+(p— 2)[(q- 2) —- (p— Iva 
< p’[a,b, c] < p(6k+2—q—v)+(p—2)[(q—2)+ ~-l va, 
that is, 
qt+1—vp—(p—2)(p—1) Va < p*[a,6, ¢] 
<q+1l—vp+(p—2)(p—V v4. 
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But by (3.5), N = (q —1)p?|a, b,c], hence 


[ 
(q—-1)[(¢+ 1) - (p-1)(p- 2) a - pv] 
<N <(q—-V[(q+1)+ (P— 1)(p— 2)./¢ — pv. 


To guarantee that the congruence (3.8) has a solution, it suffices 
to show that q¢+1—(p—1)(p— 2),/q— pv = 0. 


(83C) Ifq>(p—1)?(p— 2)? +2(py —1) then N > 0. 


PRooF. It was shown in (2) of (2C) that if g > (p— 1)?(p — 2)? + 
2(pv — 1) then q > (p—1)(p— 2),/q+ pv — 1. It follows from (3B) 
that N > 0. 
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X.4. Fermat’s Congruence Modulo a Prime-Power 


We shall consider in this section the congruence 
xP" 4 yP™ 4 7" (mod p"), 


where p is an odd prime number and n > m > 1. According to (1B) 
we may restrict our attention to the congruence 


(4.1) xP" 4 ye” + ZP" =0 (mod p™*?), 


We have already begun the study of this congruence in Chapter 
VI, 81. We recall (Chapter VI, (1H)) that there exist integers xz, y, z 
not multiples of p, satisfying the congruence (4.1) if and only if there 
exists a, 1 <a < (p—3)/2such that 1+a?” = (1+a)?” (mod p™*?), 

More generally, let k > 3, m > 1 and p be an odd prime. We wish 
to study the congruence 


(4.2) KES AE a age Pest ed pera): 
A (nontrivial) solution is a k-tuple (a1, %2,... :p) of integers, 1 < 
a, <p™ 1-1, px; (for alli=1,... ,k), oP +a3”> +---+22" =0 
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(mod p™*!). Two solutions (x1,22,...,@,) and (y1,Y2,--- , YR) are 
said to be equivalent if there exists some integer a, not a multiple 
of p, 1 <a< p™t! —1, and a permutation 7 of {1,2,... ,k}, such 


that y; = azz) (mod p™t') for i = 1,...,k. This is clearly an 
equivalence relation in the set of solutions. 

For each integer a let @ = a (mod p™*!) and let (Z/p™*'!)* be the 
multiplicative group of invertible residue classes modulo p”*!. Let 


U = U(p™™") 
= {b| there exists a, prime to p and such that 6 = a?” } 


and let 
V=V(p™t') = {b|b=1 (mod p)}. 


As is well known, U,V are subgroups of (Z/p™*'), U has p— 1 
elements, V has p” elements, and 


(4.3) (Z/p"™")* SU xV 


(see any standard text on elementary number theory, or Ribenboim’s 
book on algebraic numbers, 1999). 

Let AU =A of ae fort 1s hy for every he > 1: 
Then the congruence (4.2) has a solution if and only if 0 € kU. 

For example, let m = 1. If p = 3 then U = {1,8}, and it is easy 
to verify that 0 ¢ 3U,5U,7U, but 0 € AU for all h £1,3,5, 7. 

We deduce that a cube cannot be a sum of two, four, or six cubes, 
if these numbers are not divisible by 3. Otherwise, if, for example, 
y? = o_, #3 then (—y)?+3°°_, 23 = 0, hence also —y? + -°_, =? =0 
(with —y, %;, belonging to U), which is a contradiction. 

Similarly, if p = 5 then U = {1,7,18,24}. By computation we 
verify that 0 ¢ U,3U,5U, but 0 € AU for allh 41,3,5. Thus, a fifth 
power cannot be a sum of two or four fifth powers, if these numbers 
are not divisible by 5. 

Let g be a primitive root modulo p, 1 < g < p, and let r = g?” 
(mod p™*"), 1 <r < p™*'. Then 1,7,r?,r°,...,r?-? are pairwise 
incongruent modulo p™*1, and U = {1,7,7’,7°,... ,7?-*}. In other 
words, given g, every element 7” € U is uniquely equal to some 
power 7, with 0 < i < p—2. So every solution of (4.2) corre- 
sponds bijectively to a representation of 0 as a sum of powers of 7 in 
(Z/p™*")*, that is, to a congruence 


pitpe...t pie =O (mod pa). 
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with 0 <i, < p—2 (fort=1,...,k). 

Two such representations (r”,... , 7) and (r1,... , 77*) of 0 are 
said to be equivalent when the corresponding solutions of (4.2) are 
equivalent. Explicitly, there is a permutation a of {1,2,... ,k} and 
an integer h, 0 <h < p—2 such that i, = jr) + h (mod p— 1) for 
CHA yen. y'k 

A representation (r,... ,7“*) of 0 is normalized when i; = 0 < 
dg Sets < 4 < p— 2. Every representation is equivalent to a 
normalized representation, as easily seen. However, as we shall see, 
an equivalence class of representations of 0 may contain more than 
one normalized representation of 0. 

A cyclic solution of (4.2) is a solution (11, %2,... ,@,) where 7; =1 
(mod p™t'), «; = a/—' (mod p™*") for j = 2,... ,k for some integer 
a, not multiple of p. Each cyclic solution corresponds to a cyclic 
representation of 0, which is a representation (relative to a given 
primitive root modulo p) of the form 


(4.4) Lp pr cede -Ot = 0. (mod p™*) 


(for some i, 0 <i < p—2). 


(4A) If p = 1 (mod k) then there is a cyclic representation, 
namely taking i = (p —1)/k. 


PROOF. 


(1t+rit---tr@D) 1 —r) =1-—r" =1-r?-1=0 (mod p™*?). 


But r = g?” (mod p™*!), so r = g (mod p), hence r is a prim- 
itive root modulo p. If r‘ = 1 (mod p) then p — 1 divides i = 
(p — 1)/k so k = 1, contrary to the hypothesis. This shows that 
(1 — r’) (mod p™**) is invertible, hence 1+ r? +--+» + r@-Dt = OQ 
(mod p™*"),. 


In the particular case where k = 3 and m = 1 we have the cyclic 
representation 1+ r'+r? = 0 (mod p*) where i = (p—1)/3, r= g? 
(mod p?). So rt = (g®-0/3)P (mod p) and (r*)? = g®-)P = 1 
(mod p?), thus r’ is a cubic root of 1 mod p?. 

The following criterion for Fermat’s theorem relies on the existence 
of a representation which is not cyclic (see Klésgen, 1970): 
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(4B) Ifm>1, p is an odd prime and if there exist integers x, y, Z, 
such that p xyz and x?” + yP™ + 2" = 0, then 0 has a noncyclic 
representation modulo p>™*?, 


PROOF. We have 
ere Gr Pair =, 
hence by the result of Pollaczek, quoted in Chapter VI, (2S), we have 


+ (ay)? + y??""" £0 (mod p) 


m— 


ae 
and therefore 
x? +ary+y? #0 (mod p). 

Let w be an integer such that wx = y (mod p). So 


1t+wt+w*?Z#0 (mod p). 


From x?” + y?” +z?” = 0 we deduce that x?” + y?”" + 2°" =0 
(mod pt) (see Chapter VI, (1M)). We have also x + y+ z= 0 
(mod p). So z= —(a+y) = —x(1+w) (mod p) and therefore 


3 


2 oP (1+) 


(mod fala 


We deduce from the hypothesis that 
a (1 tw?” —(14w)")=0 (mod p*”*4), 
so 
1+w?” —(1+w)?” =0 (mod p?"*), 
If —(1+w)?” = w”’”" (mod p+!) then —(1 + w) = w? (mod p), 
hence 1+ w+ w? = 0 (mod p), which is a contradiction. 
If w?” = (1+ w)?”" (mod p+!) then w = (1+ w)? (mod p), 


sol+w+w? = 0 (mod p), again a contradiction. Thus we have 
obtained a noncyclic representation of 0 modulo p?™*?. 


Thus, for example, if p= 1 (mod 3), if the only representation of 
0 modulo p?* is the cyclic representation 


l+rt¢r% = (mod p*), 


where g is a primitive root modulo p, and r = g? (mod p*), then 
the first case of Fermat’s theorem holds for the exponent p. 
Following Klosgen, and keeping the preceding notation, we show: 
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(4C) (1) Jf1l4+r?+r? =0 (mod p™*") withl<i<j<p-2, 
then the normalized representations equivalent to the above one are: 

(Ri) 1+r'+r? =0 (mod p™*"); 

(Ro) 1+r3-*+r?-!* =0 (mod p™*"); and 

(R3) l+r?-h49 + rP-}-J** = 0 (mod p™*"). 
If 7 = 20 then the representations (R,), (Rz), (Rs) coincide. If 7 # 
2i, the representations are distinct. 

(2) If1+1+r? =0 (mod p™*"), then 1 < j and the normalized 
representations equivalent to this one are: 

(Ri) 1+1+4+7% =0 (mod p”*"'); and 

(RS) 1+ r?-t-9 + r?-!-49 = 0 (mod p™*"). 
In this case the representations (R',), (R) are distinct and 2?" = 2 
(mod p™*?), 


PRooF. (1) From (R;) we obtain (R2) by multiplying with r?~!~*: 


pP ttt pPHl 4 Pl td = (mod p™*"), 


so 1+ri~'+r?-*~* = 0 (mod p™*"), and this is a normalized repre- 
sentation (because 1 < j —i < p—1—i< p—2) which is equivalent 
to (R;). 

In the same way, we see that (R3) is equivalent to (R2). If r” + 
ret? 4+ ht) = (0) (mod p™*") is an equivalent representation which is 
normalized then one of the three cases must happen: 

(a) h =0 (mod p-— 1), which yields (R,); 
(b) h4+72=0 (mod p—1), which is equivalent to (Rez); and 
(c) h+j =0 (mod p— 1), which is equivalent to (Rs). 

If 7 = 22 then 1 +r? +r” = O0(mod p™'), l+r?+re?+t+=0 
(mod p™t!) and 1 + r?-1-% 4 rP-l-* = 0 (mod p™"). So r* = 
r?-!~ (mod p™*t!). Therefore 2i = p—1-—i (mod p-—1). But 
1 < 2i, p—1—i< p—2 hence 21 = p—1—i, thus i = (p—1)/3. We 
conclude that the representations (R,), (Rz), (Rs) coincide with the 
cyclic representation 1 + r?-)/3 4 p2@-Y/3 = 9 (mod p™*?), 

It remains to see that in all other cases these representations are 
distinct. 

If (R,) and (Rz) coincide then i = j —i, so 7 = 22, contrary to the 
hypothesis. Similarly, if (R,) and (Rs) coincide then i = p—1—J, j = 
p—1—j+iand again 27 = p—1+i7, 27 = 2(p — 1) — 27, hence 
i = (p—1)/3 and j = 2(p — 1)/3 = 22, contrary to the hypothesis. 
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Finally, if (R2) and (R3) coincide, then j -—i = p—1—j, p—1-i= 
p—1— 3+ and once more 2i = j, contrary to the hypothesis. 

(2) If1+1+7? = 0 (mod p™*') then we have the normal- 
ized equivalent representation (R4) (obtained by multiplication with 
rP-1-3), We have also j # 0, since p™t' > 3. So (Rj) is not the 
same representation as (R/,). In this case 2+7r7 = 0 (mod p™*'). Re- 
calling that r= g?” (mod p™*") and noting that g?” = g (mod p), 
then 2+ g/ = 0 (mod p), so g’ = —2 (mod p) since gi?” = —2?” 
(mod p™**), and therefore 2?" = 2 (mod p™*?). 


We may rephrase this result as follows: 


(4D) = The equivalence classes of solutions of 
Le EXP S00. God op) 


consist of six distinct solutions, with the following exceptions: 

(a) p = 1 (mod 6), a # 1 (mod p”) but a? = 1 (mod p?): in 
this case (1,@,a”) and (1,a@’,a) form an equivalence class of 
solutions. 

(b) 2?" = 2 (mod p™*"); in this case (1,1,—2), (1,—2,1) and 
(1, (—2)P" ®-2), (—2)P"-2)) form an equivalence class of so- 


Proor. According to (4C), if p = 1 (mod 6) the given congruence 
admits the cyclic solution; its equivalence class contains exactly two 
solutions (of which one is normalized). If p # 1 (mod 6), there is no 
cyclic solution. If 1+1+7r/ =O (mod p’”*') then there are precisely 
three solutions in this equivalence class (of which two are normalized 
and if this happens then 2?” = 2 (mod p™t'). In all other cases, 
each equivalence class contains exactly six distinct solutions. 


We show now that in certain cases, it is possible to obtain a new 
solution of (4.1) from a given one (Peschl, 1965): 


(4E) If1+r*+r? =0 (mod p™"), where j = 31+ (p—1)/2 
(mod p — 1), then 14+ r# + r@-D/2+5) = Q (mod p™*") and this 
representation is not equivalent to the given one. 
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PROOF. Since r{?-)/2 = —1 (mod p™+!) then 14 r? — r®* = 0 
(mod p™*!). Hence —r? — r#* + r#% = 0 (mod p™") and r7 + 
r — yr = 0 (mod p™*'). Adding these congruences, we obtain 


1+ r% — 7 = 0 (mod p™?), that is, 1 + r# + r@-D/P+s = QO 
(mod p™*?), 

If this normalized representation is equivalent to the given one, 
by (4C) we must have one of the following three cases: 

(a) 4¢ = i (mod p—1) and 5i = 3% (mod p — 1), hence i = 0 
(mod p— 1), soi = 0 and 1+1-—1 = 0 (mod p™*'), a 
contradiction. 

(b) 4¢ = 2+ (p—1)/2 (mod p—1) and (p—1)/2+5i = p—1-i 
(mod p — 1) and this leads, as before, to a contradiction. 

(c) 4¢ = (p—1)/2 — 3% (mod p— 1) and (p—1)/2+ 5i = (p 
1)/2 — 27 (mod p — 1), leading again to a contradiction. 


In a similar way, we have: 


(4F)  Ifp=1 (mod 4) and 
LltreY/44 pi =0 (mod po) 


then 
14+itr%+e-1/4= 9 (mod p™*?), 


and this representation is not equivalent to the given one. 


PROOF. 
LltreD/44 ri =0 (mod ps 
hence 
pe-H/4 4 1 as pi +(p-1)/4 =(0 (mod ger?) 
and 
tpIt@-D/4 _ pi 4 p25+@-Y/4 — 0 (mod oy 


Adding these congruences, we have 

L+1tr%t+@-D/4 = 9 (mod p™"), 
We note that r®-)/4 # +2 (mod p™*"), otherwise -1 = r®-V)/? =4 
(mod p™*'), which is impossible. So 7 4 0 (mod p— 1) and j # 


(p—1)/4 (mod p—1), so the representation obtained is not equivalent 
to the given one. 
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We shall now concentrate more on the study of the number of 
solutions of (4.2). Our method will be similar to the one in §2, in 
connection with Dickson’s theorem. 

We introduce the following notations, where p > 2 is any prime 
number, m > 0, k > 3: Let F(p,m,k) be the number of (21,... , 2x), 
such that 1 <a; <p—1 (fori=1,...,k) and 


ge 4p gh 4. ta?” = 0 (mod p™). 


Let a be any integer, 1 < a < p—1 and let F(p,m,k;a) be the 
number of (2,... ,2,) such that 1 < 2; < p—1 (fori =1,...,k) 
and 

ae + ab +--+. +a?" =ap™ (mod p™*"). 
Let N(p,m, k) be the number of (x2,... , 2,) such that 1 < x; < p—1 
(for i =2,...,k) and 


L+ap +---+a?" =0 (mod p™*?). 


If k = 3,m = 1, we shall simply write F(p) = F(p,1,3), F(p;a) = 
F(p,1,3;a), N(p) = N(p,1,3). 

First we indicate some relations between these various numbers. 
Then we shall derive inductive formulas in terms of certain periods 
of the cyclotomic field and, in turn, in terms of Jacobi cyclotomic 
sums. 

For m = 0 it is easy to compute explicitly: 


(4G) 


In particular, 


F(p,0,3) = (p—1)(p— 2), 
N(p,0,3) = p—2. 


ProoF. If 1 <a; <p—1fori=1,...,k-—2and 2,+---+2,_. #0 
(mod p) then we may choose p — 2 values for x,-1, such that 1 < 
Le—-1 S pl, Hi t+: -+aHp-2+x,~-1 #0 (mod p), and this determines a 
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unique x, 1 < x, < p—1, such that 2,+---+a,-1+2, =0 (mod p). 
So we have already [(p — 1)*~? — F(p,0,k — 2)|(p — 2) solutions. 
Now ifl<a;<p—1fori=1,...,k-—2anda,+---+2,_2=0 
(mod p) then we may choose p — 1 values of x,_; (which determine 
x,) and hence we have F'(p,0,k—2)(p—1) solutions. Thus for k > 3: 
F(p, 0, k) 
= [(p—1)*-* — F(p, 0, k — 2)|(p — 2) + F(p,0,k — 2)(p — 1) 
= (p—1)**(p — 2) + F(p, 0, k — 2). 
In particular, F(p,0,3) = (p — 1)(p — 2), F(p,0,4) = (p- 1)°(p — 
2) + (p—1). From the above relations we obtain 
F(p,0, 2k + 1) = (p ~~ 1)"*""(p oa 2) ae F(p,0, 2k = 1), 
F(p,0, 2k aa 1) — (p oa 1)P*=?(p > 2) ef F(p,0, 2k =~ 3), 


Hence, adding up, 


F(p,0,2k+1) = (p—1)(p— 2) 3 ae — 1) 
= (p—1) : (p 1)2s+1 = S (p- 1)2 
-_ (p—1)?* -1 (p—1)* —1 
= (p “@-Dt1 = (p—1) : 
Similarly, 
F(p,0,2k) = (p~2) -(p— 1)? +1 
= (p —1)2+1 — — (p-W%41 
= (p—1) Lo 1)” = (p— 1)237? 
=(-y eens w-vl 1D aes! 
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Hence, whether & be even or odd, we have 
(p- 11+ (1) 
F 


F(p, 0, k) = (p — 1) 
The same argument gives 


N(p,0, k) om (p— 1 ae ee 2) at N(p,0,k _ 2). 


Thus 
N(p, 0, 2k + 1) = (p — 1)*-?(p — 2) + N(p, 0, 2k — 1) 
and 
5 ‘ 2k 
—1)2* —1 
N(p,0,2k +1) meee ee 
22 P 


Similarly, N(p,0, 2k) = ((p — 1)?*-1+.1)/p, so for any values of k: 
(pe Dey 
5 


N(p,0,k) = 


More generally: 


(4H) With the above notations: 
(2) F(p,m,k;1) = F(p,m,k;2) = --- = F(p,m,k;p — 1). This 
number shall be denoted by F*(p,m,k). 
(3) N(p,m,k) = N(p,m —1,k) — F*(p,m,k) form > 1. In 
particular: N(p,1,k) = ((p —1)*-1 + (—1)*)/p — F*(p,1,k). 


ProoF. (1) We consider the sets F = {(%1,...,2%) | 1 < a < 
p—1fori=1,...,k and a? +---+2a2" = 0 (mod p™")} and 
N = {(ao,..., 2%) |1 <a; <p—1fori=2,...,kand1+a2 + 
--+ a2" =0 (mod p™*)}. If y is any integer, 1 < y < p—1 and 
(t2,..., tr) EN, ify. = y, yi = yxi (mod p) for i = 2,... ,k, then 
(y1, Ya,--- a) € F. Different values of y yield dificignt eolivions of 
Mee es + XP” =0 (mod p™*?), 

TE peas oe c N and (x),...,2,) € NV, with (a,...,2%) F 
(a5,...,24,), ify, y’ are integers such that 1 < y,y’ < p—1, the above 
method leads to distinct solutions (y1, yo,.-- , Yr) A (Yi, Ybs--- 5 Ye) 
because if y 4 y’ then y; ¥ yj, and if y = y’ and say x, # xj, then 
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It is also clear that every (yi, Yyo,.-- ,Yx) € F may be obtained in 
this manner, namely taking x;, 1 < x; < p—1, such that y; = y,2; 
(mod p) for i= 2,...,k. Thus, F(p,m,k) = (p—1)N(p,m,k). 

(2) Let 1 < a,b < p-—1, and let c be such that 6 = ca 
(mod p),1<c< p—1. If (a1, 20,... ,x;,) is such that 1 < x; < p—1 
and 

at +ay +---+a~ =ap™ (mod p™*), 
then letting y; be such that 1 < y; < p—1, y; = ca; (mod p) then 


yp =P" 2?” (mod p™*"). From ce” = c (mod p) we conclude that 


yo tub te tuk =P ap™ = bp™ (mod p™*"). 
In this way we establish a bijection between the sets of solutions of 
xe" sf xe fe ere he xe” =ap” (mod p™*"') 
and of 
KEL KE ss Spy (mod pe: 


Therefore F'\(p,m, k;a) = F (p,m, k; b). 

(3) Let gee es : Lage 2 = 0 (mod p™) with 1 < x; < p—1 (for 
i=1,...,k). Then 2?” +---+2?" =0 (mod p™) since 2?” = 2?” 
(mod p™). Hence there exists a, 0 < a < p—1, such that 

oP + -+- +a?" =ap™ (mod p™*?), 
Thus (21,... ,2,) is a solution of 
De hated, a0) (mod pets) 
or a solution of 
XP" esi XP Sap” (mod p™ 


for some a, 1 < a < p—1; and conversely. 
Hence, by Part (2), 


F (p,m —1,k) = F(p,m,k) + (p—1)F*(p,m,k). 
By Part (1), 
N(p,m—1,k) = N(p,m,k) + F* (p,m, k). 
In the special case where m = 1, by (4F) we have 
Ge tia)" 
Pp 


Np dk) = F*(p; 1k): 
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If p > 3, let 
0 when p=-—l1 (mod 6), 
1 when p=1 (mod 6), 


and . 
when 2?” = 2 (mod p™*'), 


1 
yeh) = { 0 when 2?” 42 (mod p™*?). 


Then we have 


(41) 
y(p,m) + 3y(p,m — 1) (mod 6), 
(p,m) + 26(p) (mod 6). 


F*(p,m,3) = 


3 
N(p,m,3) = 3 


Proor. The solutions of 1+ X?" +X?" =0 (mod p™*!) are orga- 
nized into equivalence classes. By (4D), these classes consist of six 
elements, except when p= 1 (mod 6), where there is a class of only 
two elements, and when 2?” = 2 (mod p™*"), where there is a class 
of three elements. Thus 


N(p,m,3) = 3y(p,m) + 26(p) (mod 6). 
By (4H), 
F*(p,m,3) = N(p,m — 1,3) — N(p,m, 3) 
= 3y(p,m — 1) — 39(p,m) 
= 37(p,m— 1) +3y(p,m) (mod 6). 


For the special case where m = 1, k = 3, we have 
N(p) = 3y(p, 1) + 26(p) (mod 6). 


In order to indicate an upper bound for N(p) (with p > 3) we 
need to study in more detail the Cauchy polynomials modulo p. We 
recall from Chapter VII, 82, that 


(X +1)? — XP? -1=pX(X +1)(X? +X 4+1)C,(X), 
where C,(X) € Z[X], 


_ f 1 when p=-—1 (mod 6), 
aes when p= 1 (mod 6). 
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X* + X +1 does not divide C,(X), C,(X) is a symmetric monic 

polynomial, C,(—1 — X) = C,(X), hence C,(0) = C,(—1) = 1. Let 

(X41)? — XP-1 
Pp 

and q(X) = q(X) (mod p), Cp(X) = C,(X) (mod p). 


g(X) = € ZX] 


(4J) For p> 3, we have: 
(1) All the roots of G(X) (different from 0 and —1) in F,, and 
all roots of C,(X) in F, are double roots. 
(2) If a is a root of C,(X) then each element in the set 
ay df =I 
lt+a’ lta a 


1 
Ma. = fa. a Cl + a), 
a 


is also a root of C,(X). If Ma has less than six distinct el- 
ements, then M, = {1, —2,(p—1)/2 (mod p)} (in this case 
2? = 2 (mod p”)) or p = —1 (mod 6) and a? +a4+1 = 0, 
ag By. 

(3) X?+ X +1€F,[X] does not divide C,(X). 


PRooF. (1) Let a € F, be such that g(a) = 0. 
(X 4+ 1)?! — X?-!. Ifa £0, —1 then (a + 1)?7 
q (a) = 0. Since ¢’(X) = (p— 1)[(X + 1)?-? — X?- Z then 


a"(o) = (a+ 1? - aj = + - + = Nas 


This shows that a 4 0, —1 is a double root of ¢(X). 

From g(X) = X(X + 1)(X? + X + 1)°C,(X) if C,(a) = 0 then 
g(a) = 0, so taking derivatives: 0 = (a? +a+ 1)°C,,(a). We have 
a’?+a+1 4 0, otherwise since a € F, then necessarily p = 1 (mod 6) 
(Chapter I, Lemma 4.1). Hence ¢ = 2 and C,(a) = 0, since a is a 
double root of g(X). Thus C,(a) = 0, and a is necessarily a double 
root of C,(X). 

(2) Since C,(X) is a symmetric polynomial such that C,(—1 — 
X) =C,(X) and C,(0) = C,(-1) = 1, then if C,(a) = 0 then each 
B € M, is also a root of C,(X). We suppose that M, has less than 
six elements. Then one of the following cases happens: 


(i) a =1/a: then a = +1. 
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a = —(1+ a): then a = (p—1)/2 mod p. 
a=-—1/(1+ a): then a?+a+1=0. 
iv) a= —a/(1+a): then a = 0 or a= —2. 
a= —(1+a)/a: then a?+a+1=0. 


M, = {1,-2, (p — 1)/2 mod p} and from ((1+ 1)? —1?—1)/p = 0 
(mod p) we have 2? = 2 (mod p’). If a®¥+a+1=0 with ae F, 
then p = 1 (mod 6) so ¢ = 2. Since a is a double root of q(X) it 
cannot be a root of C,(X). Thus a ¢ F, and p= —1 (mod 6). 

(3) If p=1 (mod 6) then ¢e = 2, X?+ X +1 has root a € F, 
which is a double root of g(X), hence not a root of C,(X). Hence 
X? + X +1 does not divide C,(X). 

Let p = —1 (mod 6), so ¢ = 1. The roots of C,(X) appear in 
groups of six distinct roots, with the following exceptions: 

(i) the group of three double roots {1, —2, (p — 1)/2 mod p}; and 
(ii) the two roots of X*? + X +1 (which are necessarily outside 
F,). 

If C,(X) = (X?4+ X + 1)"A(X) with r > 1 and X?4 X +T not 
dividing H(X) € F,[X], then 


deg C,(X) = 2r + degh(X) = 2r (mod 6), 
in view of the grouping of roots of H(X). But if p = 6n — 1 then 
deg C,(X) = 6(n — 1), so 2r = 0 (mod 6), hence r = 0 (mod 3) and 
therefore r > 3. Thus if a? +a+1=0 then a is a triple root of 
C,(X), hence also of g(X). Therefore 
g(X) = (X41 = xP 


and 
7'(X) = (p- (K+ IP? - xP 
vanish at a: 


(a+ 1)??=a?}, (a+ 1)? 7? =a??. 


Comparing, we have 


a? = (a+ 1)(a+ 1)? ? = (a+ la? ? =a? * +0", 


hence a?~? = 0, so wa = 0, a contradiction. 
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We may now determine an upper bound for N(p): 


(4K) 
(1) If 2? £2 (mod p’) then 
p when p= 1 (mod 12), 
p when p = 5 (mod 12), 
NOE 24 
a when p= 7 (mod 12), 
—1l1 
: 5 when p = 11 (mod 12). 
(2) If 2? =2 (mod p’) then 
—3 
= when p= 1 (mod 12), 
—11 
= 5 when p = 5 (mod 12), 
N®)<) yg 
i when p= 7 (mod 12), 
—5 
> when p = 11 (mod 12). 


PROOF. We recall that the nontrivial solutions of 1+ X?+ X? = 
(mod p?) correspond to the nontrivial solutions of 1+ X” = (1+ X)? 
(mod p’), that is, to the zeros in F, distinct from 0, —1 of 


(X +1)? — xP —-1 
Pp 
(1) Ifp=12n+1 then g(X) = X(X +1)(X?4+ X 4+ 1)°C,(X), 
deg q(X) = p—1, deg C,(X) =p —7 = 12n— 6. Since every root of 
C,(X) in F, is a double root and the roots appear in groups of six, 
then 


aX) = 


(mod p) € F,[X]. 


6n — 1 
6 


— 4= 


N(p) <2+6] | =2+60 6=6n—4 


If p = 12n+5 then q(X) = X(X +1)(X?+X+1)C,(X). The 
same argument gives (noting that X? + X + 1 has no root in F,): 
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deg C,(X) = 12n, N(p) < 6n = (p—5)/2. If p = 12n+7 and 
p = 12n+ 11, we proceed similarly. 

(2) If2? =2 (mod p?) then g(X) = X(X+1)(X?4+ X4+1)*(X - 
1)?(X + 2)?(X — (p—1)/2 mod p)? A(X) where the roots of A(X) € 
F,,[X] appear in groups of six distinct double roots. 

If p = 12n+1 then ged A(X) = 12(n — 1) and N(p) < 2434+ 
6(n — 1) = 6n —1= (p— 3)/2. 

In the same way we derive the other upper bounds. 


All the preceding considerations do not yet provide any explicit 
formula for the numbers of solutions of the congruences in question. 
As we shall now see, such formulas may be obtained using Gaussian 
periods and Jacobi cyclotomic sums, as in §2. 

Let p be an odd prime, m > 1, let h be a primitive root modulo 

pt. let ¢ be a primitive root of 1 of order p™, and let p be a 
Sante root of 1 of order p™*!, p? = ¢. The Gaussian periods 
ni = ni(p,m, h) are defined as follows: 


(4.5) 
mo = pt ph + ph pee $ ph” 
- _ ot is pee + pees norte pr” 2)pmt 
Npm—1 = ge + oe as per renege pee, 


It is also convenient to agree that 7, is defined for any index j, by 
letting n; =; when j =i (mod p™),0<i<p™—-1. 

If g is another primitive root modulo p”*!, then g=h" (mod p™*') 
where gcd(r,p™(p — 1)) = 1. If n;' = (p,m, g) then n;' = 7,; (for 
i=0,1,...,p™— 1). Indeed, 


p-2 i p-2 p-2 Peer 
_ giP +i = or nie +ri - 
ni’ => pe = p = "ris 
j=0 a t=0 


because if jr = t (mod p— 1), 0 < t < p—2 then {(jrp™ + ri) 
mod p™(p —1)| 7 =0,1,...,p—2} = {(tp™ + ri) mod p™(p— 1) | 
t=0,1,...,p—2} as we may easily verify: 7 4 7’ (mod p”) if and 
only if jr 4 j’r (mod p™). Hence up to a change of numbering, the 
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Gaussian periods are independent of the choice of the primitive root 
modulo p™*?. 

Let us observe now that each period 7; is a real number. Indeed, 
n; is the sum of (p — 1)/2 pairs of complex-conjugate numbers: 


pie +4 4 pi(p-1)/2+35)p™ +4 pie +i _ pip +i pie +i 


=p +p =p pre EMR: 


And we also have the gross estimations 


p-2 
jp ™ +i 
(4.6) lal << do |e" "| <p 
j=0 


(the equality would hold only if the numbers per would be all 
multiples of one of them, which is not the case). 

For every t not a multiple of p, let ind,(t) = s, where 0 < s < 
(p —1)p™ —1 and t =h* (mod p™*’). 

The Jacobi sums 7; = 7;(p,m,h) (for 7 = 0,1,...,p™ — 1) are 
defined by 


pvtt—1 

(41 = (Op) = S citdagt 

pyre 
In particular, 
(4.8) To — 0. 
Indeed, 

prtt_4 
To = (1, p) = S- p', 
pre 


so T) is the sum of the primitive (p”*')th roots of 1. Hence 7 is 
the coefficient of the term of degree y(p™*') — 1 of the cyclotomic 
polynomial 


ae = 1 = xX (e-Vp™ + X (p—2)p™ + ooh + xe + Ts 


thus 7) = 0. 
We shall see that 7; £ 0 if and only if p /j. For this purpose we 
require the following lemma about sums of roots of unity: 
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LEMMA 4.1. (1) Let n > 1 and let € be a primitive nth root of 1. 
For every integer a: 


Se = n when n|a, 
a> (0 when n fa. 


et p be an odd prime, m > 0, and let p be a primitive roo 
2) Let p b dd pri >0 d let p b mate t 
of 1 of order p™*!. For every integer a: 


pnt p(p™*") when p™** | a, 
Sop =4 —p™ when p™ | a, pt! Ja, 
re 0 when p™ Ja. 


Proor. (1) Let d= gced(n,a), n = dn’, a= da’, so gcd(n’,a’) = 1. 
Since €¢ is a primitive root of 1 of order n then €™ is also a primitive 
root of 1 of order n’. Hence 


et => (ee Nae 


But each x, 1 <x <n, may be written as v7 =hn’+y,1l<y<n’, 
0<h<d-—j. So 


x2=1 r=1 
d-1l n 
hn'+y 
d 
=e 
h=0 y=1 
d-1 n’ 
d 
=) 
h=0 y=1 
_ f vd=n when n’ =1,ie., n| a, 
ha when n’ > i: ie., n/a. 


(2) If p™** | a then the sum 


is obviously equal to ®(p”*'). If p™*! a, p™ | a, let a = bp”, 
with pb. Then ¢ = p’” is a primitive pth root of 1. Every 2, 
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1<a<p™'! is of the form tc =hp+y,0<h<p”™-1,l<y<p; 
moreover, p x when y # p. So 


If p™ /a then 


St+ S- oS pao 


p\|x c=1 
1<e<pmtt 


by (1), since p™** Ja. 
Next we note that p? = ¢ is a primitive root of 1 of order p™. Each 
x, multiple of p, 1 < x < p™*', is written as x = py, 1 < y < p™. So 


Pp Pp 
s pe = S- cw — 0 
e=1 y=1 


by (1), since p™ /a. We conclude that S = 0. 


The following lemmas concern indices: 


LEMMA 4.2. Let p be an odd prime, m > 0, h a primitive root modulo 
p™*! andi an integer, 1 <i<p—1. Then 


ind,(1+ip”) =iap™"' (mod p™) 
for some integer a, not a multiple of p. 
PRoorF. Let s = ind, (1+ ip”), so h* =1+ip™ (mod p™*'). Then 
he? S (ap)? S14 ipsa (mod grt), 

Hence sp = 0 (mod p™(p — 1)), that is, sp = —bp™(p — 1) = bp™ — 
bp™*!, then s = bp™ 1 — bp™, so s = bp (mod p™). Since 
p) i there exists a such that b = ia (mod p), hence s = iap™? 
(mod p™). 

It remains to show that p /a. Otherwise s = 0 (mod p”™), that is, 
1+ip™ = hP"° (mod p™*") (for some integer c). By (3.3), (1+ip™) 


mod p™+! = hP”"* mod p™+! € UNV = {i}, and therefore i = 0 
(mod p), which is a contradiction. 


m—1 
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For every j = 0,1,... ,p™(p — 1) — 1 let U; = {@ € (Z/p™*")° | 
ind, (@) = j (mod p™)}. Since U = {a@” | a € (Z/p™*")*} then U; is 
a coset of (Z/p™*')* modulo U. In view of a later result concerning 
sums of squares of periods, we need the following description of these 
cosets. 

Let 1<k <m, let i=0,1,... ,p™—-* — 1, and let 


SOOM Oi ee em Cee eres 
Similarly, let 
Si = {(1+ ip) -U, [1 +44 pr-*)p] - U, [1 + (¢ + 2pm-*)p] - U, 
... {1+ (i+ (pk — 1)p™-*)p] - U}. 


First we note that for each i, the cosets Uj ipm—«, Ujgupm—« (with 
14’) are distinct. Otherwise, i+ lp’™"* =i+I'p™* (mod p”™), 
hence | = 1' (mod p*), contrary to the hypothesis. So #S; = p*. 

Ifi Ai’ then S,N Sy = 0: if it lp™* = 7 +U'p™* (mod p™) 
then necessarily i = i’ (mod p™*), soi=i. 


So oN S; has p™ cosets, that is, it consists of all the cosets 
modulo U. Similarly, the cosets 


[1+ (+ lp™*)p] -U, [1+ G+ Up™-*)p] -U 
(with 1 £ I’) are distinct. Otherwise 


1+(itlp™-*)p- (Gl + (i+ 'p™*)p)) EeVnU = {i}, 
so 
1+(it+lp™*)\p=1+(it+l'p™*)p (mod p™*?), 
hence 1 = 1' (mod p*). So #S! = p*. 
If 47 then S; NS’ =O: if 
(1+ (¢+lp™-*)p] -U = [1 + (a + Vp™-*)p] -U 
then as before 


1+(itlp™™*)p=(1+7 +l'p™*)\p (mod p™*?), 


hence necessarily i = i’ (mod p™*), so i = 7’. Thus te S; has 
p™ cosets, that is, it consists of all cosets modulo U. 
Now we prove: 


LEMMA 4.3. There exists a permutation a of {0,1,...,p™~* — 1} 
such that S; = Sz). 
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Proor. Let ind(1+ip) = g, that is, (1+ip)-U = U,. We show 
first that for every 1=0,1,...,p* —1, 


ind[1 + (¢ + lp"-*)p] = j + sp™* 


for some s, 0 < s < p* — 1; this is equivalent to showing that 


[1+ (6+ lp™-*)p] -U = Ujsepm-e 


and allows us to define m(i) = j, with S/ = S,(;. Indeed, let i’ be 
such that (1 + 7’p)(1 + ip) = 1 (mod p™*") and let 1 = I'p" with 
p/U,0<r<k. We define b, 1 < b < p*"" —1 by the congruence 
l'(1+i'p) =b (mod p*~"). Then 


[L+ (G+ lp™™*)pl[1 +p] = 14+ 11+ i'p)p™ 
=| +1V(1 +ip)pr htt 
= 1+ bp™-**"*1 (mod p™*"), 


But 
(1 xe ioe are ST a (mod hale F 
SO 


m—k+r 


1 ea bp ar = pie-lp et 


(mod p 
Taking s = t(p—1)p” then 


1+ (é+lp"*)p = (1+ ip)h?” "= hit?" (mod p™*), 


as we needed to prove. 

We show that if i 4 @’ then 1(i) 4 m(2’). If h7 = 1+ip (mod p™*"), 
hd’ =1+i%'p (mod p™*) and j = j’ (mod p™—*), let j/ = j+sp™-*. 
So 1+i'p = (1+ip)h*®" " (mod p™-*), hence h*?”" " € V. So sp™-* 
is a multiple of (p— 1), hence also of (p— 1)p™*, that is, sp™-* = 
t(p —1)p™-*. Let h?-? =1-+ ap (mod p™*?). Then h?” " = (1+ 
ap)?” * = 1+ atp™-**1 (mod p™-**1), Therefore 1+ i’p = (1+ 
ip)(1 + atp™-**1) = 1+ (i+ atp™*)p (mod p™-**?). We conclude 
that i’ =i (mod p™—*), so i’ =i. 

We conclude therefore that 7 is a permutation with the required 
property. 


We now derive some formulas for Jacobi sums and Gauss periods, 
which are analogues to the ones already proved in Chapter IX, §2. 
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LEMMA 4.4. (1) If7 =1,... ,p™—1 then T; = Tym_; (77 denotes 
the complex conjugate of T;); 
mt+1 es 
we ND when p Jj; 
(2) 973 ={ 0 when p | j; and 
(3) 7; #0 if and only if p does not divide j. 


PROoF. (1) 


pmtt 


T; = S- Cone) ge" 


a2=1 


But ind; (—) = ind; (—1)+ind,(x) (mod p™(n—1)) and ind;(—1) = 
ip™(p — 1), so Ginds(-) = 1. Hence 


m+1 
P 
a rac(=te) 22 
co oe Cpe ne *)p ° = Tym; 
ve 
(2) We have 
pmti_y prt 1 
TF} = S- CJ indn (2)p* S- C—Finds (y) p-y 
oye ayy 


pmtt_y pmtt_y 


S- S- Ci (indn (2)—indn(y)) pe—y 


x=1 
pix pia 


For every x,y as above, let t,1 < t < p™*' —1, be such that 
y = at (mod p™*'), hence p} t and ind;(y ” = ind,(x) + ind;(t) 
(mod p™(p—1)). Then 


prtt_ypmtt_y 


Tj; = ~ C —j ind; ( Op xz(1-t) 


Bie ois 
pmtt_y pmtt_y 
es j ind, (t xz(1-t) 
— 2 ae. 
e=1 
pit pfx 


Using Lemma 4.1, we have 


p-1 
‘=p oI py oe 
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where the first summand corresponds to t = 1 and the second sum- 
mation to the terms t = 1 + ip™; the other values of t give the sum 
0. By Lemma 4.2 


ind,(1+ ip”) =iap™' (mod p”) 
(where p a), so 


p-1 
ny = pot) lp) SG 
i=l 
_ { PpH=hapt ap . stp yy; 


peel) pip} ls tp |: 
(3) This is obvious from (2). 


We note the following connection between the periods and the 
Jacobi sums: 


LEMMA 4.5. Consider the matrix Z = (C7) 55-040. pm—1 and the vec- 
tors 


To 7o 

77 mM 
Ts . 5 fl — 

Tpm— 1 Tpm —1 


Then: 

(1) ZZ =p (I identity matrix), that is, 
S- CCN ep Oe 
k=1 

(fort. SU Las pe 1): 


(2) Zr = py, that is, 


peal 


De ry es 
g=0 


Gort ]0) 1p" = 1). 
(3) 7 = Zn, that is, 


p™-1 


Cony = Tj 
j=0 


(or @]0,15433 gp = 1); 
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Proor. (1) By Lemma 4.1(1), 


p™ pl i Winive . bans te : 
ik -—kj (i-j)k _ J P when p™ | i — j, that is, j = 4, 
eee as 


as = when 7 £1. 


p’-1 pr-1 ort : 
> Cir _ S- ame > CJindn(2) pe 
j= j0 a 
prt pm-1 
cating 
= ys or » ex i+inda (x)) 
r=1 j=0 


But by Lemma 4.1(1) 


m 


my : : an 
> cit-ttindy (oy) — f P when ind,(x) =i (mod p™), 
oa 0 otherwise, 

hence the sum to be evaluated is equal to 


p-2 te 
o> p= =n, 
a=0 


ind, (x)=i (mod p™) 


since ind;(x) = i (mod p™), 1 < x < p™ — 1 is equivalent to 7 = 
he" +* (mod p™*), withO<a<p-1. 

(3) From Zr = pn, by multiplication with Z we obtain pr = 
ZZrT =p” Zn hence tT = Zn. 


As a corollary we obtain the vanishing of special sums of periods: 


LEMMA 4.6. Let p be an odd prime, m > 1, let 1 <k <m,i= 
0,1,...,p™-* —1. Then 


pr-1 


S- Nitapm—-k = 0. 
x=0 
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Proor. By Lemma 4.5(2): 


pel p®—1p™-1 
m = —(it+ap™—*)j 
Pp hi+apm—* Tj 
x=0 


l| | 
as) 
Nall 
| 
ia 
I~ : 
| 
i. a) 
3 ~~ 
P as 
SS 
Mi 
a 
oT 
8 
s 
3 
7 
wS 
Ne 


where € = C?”" “. Since € is a primitive root of 1 of order p*, by 
Lemma 4.1(1), 


-1 : : 
> E774 = pe if pe | J; 
0 otherwise. 


Hence 


«2=0 


pe-1 p’-1 7 
p” (s nor) =o S- Cr = 0, 


by Lemma 4.4(3). 


For example, ifm = 1 then k = 1 and 6 n; = 9, which was 
already known. If m= 2, k = 1 then 


p-1 
(4.9) Snape for a= a pa 
j=0 


If m= 2, k = 2 then ie Na = 0, as already known. If m= 3,k = 
1 then 


p-1 


(4.10) So Nitjp? = for #=0,1,...,p’°-1. 


j=0 
If m= 3, k = 2 then 


p-1 


(4.11) pao «for 4 S01 pd 
j=0 
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and finally, ifm = 3,k =3 then ee nj; = 0, as already known. 
We may also evaluate the following sums of squares of periods: 


LEMMA 4.7. Let p be an odd prime, m> 1, letel1<k<m,i= 
0,1,...,p7-* —1. Then 


PRroor. Let U; = {@ € (Z/p™*")* | ind, (a) = j (mod p™)} for 
j =0,1,...,p”™(p—1). Then U; is a coset modulo the subgroup U. 
By definition, 

hy = Se p" . 


O<s<(p-1)p™ 
seu; 


We keep the notations preceding Lemma 4.3: 
S; = {U;, Uispm-e, Ujs2pm-r, Sea Us. (pk—1)pm—« }, 
S; = {(1+%p) -U, [1 + ipe*)p] - U, 
cay ee (ab pep) pl Us fy 


and let 7 be the permutation of {0,1,... ,p’~* — 1} such that S/ = 
S,(:) (see Lemma 4.3). Then 


pe-1 


ee = DG = DH. 
xz=0 


U;ES; U;ES,-1() 


Thus we need only to evaluate the sums 


pR-1 p-1 2 
iptap@ti—k)gp™ 
POD aes ) : 


x=0 a=1 
We have 
p®—1p-1p-1 
; m+1—-k m m 
S; = or tee (a? + bP") 

xz=0 a=1 b=1 

pote m m m— 1 pr-l 1 
=) aP  +bP +i(aP +b? ) x(aP +b? ) 
= p ¢ gE ’ 

a=1 b=1 x=0 

m+1—k m-1 


(so PY = 1), noting also that a?” = a? 


(mod p*), and similarly for b. 
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By Lemma 4.1, 


p—l oe 9, ge 
Si ae es 
a=1 


1 


“) = pp - 1), 


and this concludes the proof. © 


In particular, taking k = m we have 


pl 


(4.12) do =P = 1): 


We specialize now the above results to the case where m = 1. Let 
a,l1 <a < p—1 be defined as follows. If s = ind,(1 +p) then 
h’ = 1+ p (mod p”) hence h’? = (1 +p)? = 1 (mod p?), hence 
p(p — 1) divides sp, so p— 1 divides s. We define a by a(p—1) = s. 


LEMMA 4.8. Ifm = 1, a(p—1) = ind, (1+p) then ind; (1+cp) = —ca 
(mod p) for every c=0,1,...,p—1. 


Proor. From h*®-) = 1+p (mod p”) we have h°@®-) = (1+p)° = 
1+cp (mod p*). Hence ind;(1+ cp) = ca(p—1) (mod p(p—1)). In 
particular, ind),(1 + cp) = cap — ca = —ca (mod p). 


LEMMA 4.9. If m= 1, the Jacobi sums (fori = 0,1,...,p—1) are 
given by 
a dae 
where a(p — 1) = ind,(1+ p). 
PROOF. 


p’-1 


j= S- Cond ote 


x=0 
By (4.3) we may write in a unique way 
x =h?’(1+cp) (mod p’), 
where 0 < b,c < p—1. Then by Lemma 4.8, ind;,(x) = pb+ind,(1+ 
cp) = —ca (mod p). Therefore, 


p—l1 


p-l p-1 p-1 
T= ye Cane S- po en = S- pe S- Gants) 
c=0 b=0 c=0 


b=0 
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noting that p? = ¢). Since ia # 0 (mod p) then (za)? (mod p?) € 
: ) 


U. So there exists bo, 1 < bp < p—1 such that (ia)? = h?’> (mod p? 
By Lemma 4.1(1), 


cei = Pp when b = bo, 
= ~ | 0 when bF bo. 


We conclude that 


pb ia)? 
pro (ia)? 


Th= 


In analogy with (2B) we indicate an inductive expression for 
N(p,m;k) and F(p,m, k) in terms of the periods (see Klésgen (1970)): 


(4L) 
1 pet 
(1) F(p,m,k) = —F(p,m —1,k) + S- nk; and 
P P i=0 
1 (eae 
i=0 
Proor. (1) By Lemma 4.1(1), we have 
m+1 1 


p-1 p-l p—1p 
~ 


p™*! F(p, m, k) ye ey 


vy=l1arQ=1 Lp=1 y=0 
p™tt_y p-1 - p-1 
= py se S perk 
y=0 w=1 Le=l 
prtt_1 /p-1 x k 
= ya? 
= p 
y=0 xr=1 
pmtt_y sp-1 k prtt_1t /p-1 k 
oa ge? 
De Woe apy ae 
y=0 x=1 y=0 w=1 
ply ply 
p™—-1 /p-1 ce k pmtt_1 /p-1 k 
= taP yx? 
=i ag | Dae ey Dee | oe 
t=0 c=1 y=0 x=1 
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since p? = C and 2?” = 2?" ' (mod p™). Since 


p™-1 /p-1 toa k 
p”F(p,m —1,k) = S- (Se ’ 


t=0 xc=1 


by the same computation as above, then 
p™tt_1 /p-1 k 
p+! F(p,m,k) = pF (p,m — 1, k) + oe i 


Each y, 0 < y < p™*'—1, p J'y, may be uniquely written in the form 
y = h' (mod p™*?), with 0 < i < p™(p—1). Also, each x?” (with 
1 <x < p—1) may be written uniquely as x?” = h®” (mod p™+') 
with 0<c<p—2. Hence 


> 1 (Sp ) p™(p-1)-1 b 7 ’) 
yaPr™ p it+cp 


he 


T 
ike 


T 
ae 


1=0 
p™-1 
=(p-1l) don 
1=0 


Thus 


Pp 


(2) Dividing by p— 1 and taking mee into oe, we deduce 
the recurrence relation for N(p,m;k). 


As a corollary, it follows from (4F) that 


(4.13) F(p) = iis 
Pp De i=0 
ee pe 
(4.14) N(p) = #4 5 3o np. 
Pp P" i=0 


Let S(p, k) = 32°25 [ni(p,m)]*. From the recurrence formulas in 
(4L) we obtain the expressions: 
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(4M) 
(1) F(p.m,k) = Plt +(—1)+ 80", B) + 5(9""4, b+ 
--+ S(p,k)]. 
(2) N(p,m, k) = wee 1) pha) (o" ky ES (pes et 
--+ S(p,k)]. 
PRooF. (1) We have 
1 1 
F(p,m,k) = ae m—1,k)+ ee” ik), 


1 1 pal = 
—-F(p,m—1,k) = =F (p,m —2,k)+ S(p”~~k), 
: (p ) P ( ) 5 ( ) 


ah 1 
—— F(p,1,k) = F(p,0, k) + P_S(p, k). 
Pp p™ p® 


Adding these equalities and taking into account (4G), we deduce 
that 


F(pm,) = 2p 1h + (yf + $0", 8) + S19", B) 


+-+>+ S(p,k)). 


(2) This formula is obtained from the preceding one by dividing 
by p—-1. 


Taking k = 2 in the above formula and noting that N(p,m,2) =1 
(trivially) then (4M) yields again the relation of Lemma 4.7 for k = 
m. Indeed, if m= 1, 


1 = N(p, 1,2) = ale 1) +1+ S(p,2)], 


hence 
p’-l 


S(p,2) = So Ine, DP = p(y - 1). 
j=0 
Assuming by induction that for r < m we have 


p'-1 


S(p",2) = S¢ [ni(p, ry? = p"(p— 1), 


j=0 
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then 
1 =-N (p,m, 2) 
1 
= pone OOS hp dpa es Bp) 


Hence S(p",k) = p™(p—1). So S(p™,k) is an integer which is a 
multiple of p™. 

For the next result, which is about sums of powers of the Gaussian 
periods, we shall require a lemma about the q-adic values of products 
of factorials. 


LEMMA 4.10. Let q be a prime, s > 1, w>1 and let r1,72,... , Ty, be 
integers greater than 0 such that q& =1T1 +7r2+---+7,. Then 


Ug(Til Ta! - Ty!) < quy(q*"!), 


tid pe SG HS aoe Sh SH] then lg ge) = 
qv (a?-). 


ProorF. If s = 1 the statement is trivial (since 4 > 1). We proceed 
by induction on s. 

We have [ri/q] + ---+[r./q| < ¢*', so for some integer ro’ > 0 
we have rj + [ri/q] +---+ [r./q] = |. By induction, 


cy tn( Ef) 4 +0([E}) < one. 


By Chapter II, (1A), if ¢ > 1 is an integer then 


wo-[Jelal+fel> 


and if x > 0 is any real number, a > 1 any integer, then [x/a] = 


[[2]/a], so 
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EECA tel) 


=i =i q vn 
< gh) +u,(r w+ dw, ([2 |!) 
ag* + quy(qr- at) 
EO Tae oe ee) 
= q(qe~? sage. pl here +qt+ 1) — qu,(q*'!). 


The last assertion is of course trivial. 


Now we may derive the following properties of the sums S(p™, k): 


(AN) 
(1) S(p™, k) = p'™ |pN (p,m, k)—N(p,m-1, k)| = p™ |F (p,m, k)— 
F*(p,m, k)]. 
(2) Ifq is a prime, k= q°, then S(p™,q*) is a multiple of q. 


PRroor. (1) By (3L) and (4H), 


S(p™,k) = p™ |pN (p,m, k) = N(p,m _ 1,k)| 
= P' \(p = 1)N(p,m, k) F"(p,m, k)] 
== p"|F(p,m, k) — F*(p,m, k)). 


(2) By virtue of (1) we may assume g ¥ p and it suffices to show 
that q divides F(p,m, q*°) and F*(p,m,q*) = F(p,m,q*;c), where 
l<c<p-le=q aiele) (by (4H)). Let 21,20,...,2, € Z 
be : such that ae + oe +-++++ a? =0 (mod p™*'), respectively, 
ge gk shes kg Se on p”*'), where 1 < x, < p™*?—-1 
fori = 1,... a ke . Each k-tuple obtained by a permutation 
from (2 ,%2,... , Xx) is still a solution of the congruence. So the set 
of solutions is organized into disjoint classes of solutions equivalent 
under permutation. Therefore it suffices to show that the number of 
solutions equivalent to any given one is a multiple of q. 


Let the k-tuple (x1, 22,...,2,) have exactly jy distinct compo- 
nents, repeated, respectively, r1,7r2,...,7, times (with r; > 1). We 
note that pb # 1; otherwise from x7; = %2 = --- = X_ we deduce 


qxi = 0, recwectinelcd (mod p), which is impossible. It follows 
thiak eh oti ek 
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The number of solutions obtained by permutation from (2, X2, 
.,£,) is equal to k!/(ri!ro!--+r,!). So we need to prove that q¢ 
divides this number. We have 


ug(k!) = ge +q? 7 +--+ q41. 


Since r1 + ro +--+ +7, = @ and pw > 1 then v,(r1! re! ---r,!) 

AC la a lie ere a a cae a le ca 
Therefore the q-adic valuation of k!/(r!r2!---1r,!) is at least equal 

to 1, which was to be proved. 


IA 


We now indicate an upper bound for N(p) in terms of the periods. 
In view of (4.13) we are led to find an upper bound for the sum 
y-?-) 13. For this purpose we establish the following lemma: 


LEMMA 4.11. Let n > 3, let f be the function of n real variables 


n 


Tice: 1 Yn) = Soy? 
i=1 
defined on the set D of all points (y1,... , Yn) such that 7"_, y; =0 
and Yi ¥; = n(n — 1). 
(1) If (y1,--- Yn) 1s a@ point where the function assumes a mazi- 
mum or a minimum, then there exists an integer T,1<T < 
n—1, such that (up to a permutation of {1,...,n}) 


n—-1 
Y1 UT (n ) Ta—P): 
n—-1 


Let y" be the point with the above coordinates. 

(2) fy") = nln — Yin = 27) V(n—D/(Tn=T)), f(y"-") = 
—f(y"), f(y7) > 0 for 1<T < (n—-1)/2. 

(3) If T =1 then f(y?) = n(n —1)(n — 2) is the absolute mazi- 
mum of f on the given domain D. 

(4) If1 < T <n-—1 then f(y") is the absolute maximum of 
f on the points y = (y1,--- Yn) € D such that y; < (n- 


T)V(n-1)/(2(n- 7). 
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Proor. (1) To find the points of maximum or minimum of the 
function F’ on the domain D, we employ the method of Lagrange 
multipliers. Let A, ~ be parameters to be determined and 


F(yt,-++ Ym) = F(Yis--- wl +a(Son) +n (Sout nev). 


w=1 w=1 
If (y1,--- Yn) is an extreme point then (OF/Oy1)(y1,---,Yn) = 0 
(fori =1,...,n), that is, 
(4.15) 3y7 +At+2uy,=0 (fori=1,...,n). 
Adding up these relations, we obtain 3n(n — 1) + n\ = 0, hence 


A\ = —3(n—1). Substituting this value in the above condition (4.15), 
we have 3y? + 2uy; — 3(n — 1) = 0, hence 


—px Ve? + O(n = 1) 


4.16 = 
(4.16) y 5 
Since there are only two possible values for the coordinates then, up 
to a permutation of {1,... ,n}, there exists an integer T, 0 <T <n, 
such that 
—pt+ Vp? + 9(n—- 1) 
Wa =yr = 
3 
—p—-Vf/pP+9n-1 
YT+1 = °° = Un = ( ) 


3 


We note that not all coordinates can be equal since )>;_, y; = 0. 
Hence 1 < T < n— 1. Moreover, 


m 
0=—-3(T+n—-T)+5yw+9n—-1(T—-n+T7), 
hence 
np = (2T — n) [2 + 9(n — 1); 
therefore 
ny? = QT — np? +9(n—V)] 
and finally 
_ 3(2T — n) n—1 
2 T(n—-T)’ 
— 1)n? 
Peon ja )n 


— AT(n-T) 
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Substituting into (4.16) we obtain 


n—1 
(n-T) Tin Ty’ or 
(4.17) Yi = al 
ae T(n—T) 


(2) For the point y’ with above coordinates, we have 


i —1 a 
i a - DVT - 
i=1 LO 
— a 
(n DVT 
= 
n—1l 


= n(n 


Let 


F(t) = n(n 2p 


for 0 <t<n._Then f(n—t) =—f(t); if 0 <t < n/2 then f(t) > 0 
and f(y?) = f(T) for T=1,2,...,n—1. If0<t< (n—1)/2 then 
t(n —t) < ((+1)(n—t-—1), as seen at once. Hence 


n — 2t S n— 2t—2 
Vt(n—t)” V(t+1)(n-t-1) 
and therefore f(y’) > f(y"**) > 0 for T = 1,2,... , [n/2] — 1. 


(3) For T = 1 we have f(y!) = n(n—1)(n—2). Since the function 
is continuous and defined on a closed and bounded domain, it has a 
maximum and a minimum. By (1) and (2) it follows that f has the 
absolute maximum at y’. 

(4) Let 1 < T < n—1 and let Dr consist of those points of 
D with coordinates y; Seve e yi < (n-T)V(n—-1)/(T(n—-T)). 
Ifl< T <n-1 wen y’ € Dr exactly when T < rT”. Hence 
from f(y ~!) > f(y") > 0 for 2 < T’ < [n/2] and f(y?) < 0 for 
T’ > [n/2], we conclude that f(y") is the absolute maximum of f 
on the domain Dr. 
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With the above notations we have: 


(40) Let M = max{n; | i = 0,...,p—1}, let T be the largest 


integer such that 
| p-l 
M< T t 
p-—l 
N(p) < 1+ (p-2T) T(p—T) 


Proor. Since 7?-5 7; = 0 then 0 < M. By (4.6), M <p—1. The 
1 
t) 


function 


is decreasing, f(1) = p—1, f(p—1) = oe ne there exists the largest 
integer T,1<T < a —1, ich 


M < (p—T) Frees} 


p-1 p-l 
Yim=0, Sin? =p(p- 
1=0 1=0 


as seen before, then (7, 1,--- ;Mp-1) € Dr. By Lemma 4.11, f(yf) 
is the absolute maximum of f on Dr and we have 


Then 


Since 


p-l a | 


ye =f (nor. i%p-1) S fly") =p -— 1) @— 27) Tay 


By (4.13), 
IG, es 
N(p) = ——+-4 i 
(p) 3 po 
LPH? Nip=2P) jf) pad 
p p Pip?) 
p-—l 
<1l+(p-2T 
( ) ToT) 
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We note that if 


p-l 
Lipa): 


M = (p—T) 


then N(p) < 1+ M. Indeed, in the above proof we obtained 


NG == Eee pad 


p p T(p—T) 
ae) 2 pe OP 

50 polp u 
p Do Paw, 

<1+4+M. 


Now we study the asymptotic behavior of F'(p,m,k), N(p,m,k) 
when k tends to infinity. 


(4P) 
‘ N(p,m,k) | 
es (p— 1) T/pmet 
PRooF. From (4M) we have 
N(p,m, k) s (—1)' = S(p’, k) 
(ppt taper to) X (p=) 


3 


1+ Gye Y vey (He Dy 


j=l i=1 


Since |n:(p,7)| <p —1, by (4.6) it follows that 


Jin {MOTD =o 


hence limyz_,.. N(p,m, k)/((p — 1)*"1/p™t?) = 1. 


We consider the existence of p-adic solutions of a certain congru- 
ence. Let U, denote the multiplicative group of (p — 1)th roots of 1 
in the ring Z, of p-adic integers. We observe: 
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(4Q) 
(1) There exists an integer my = Mo(k,p) > 1 such that for 
every m > mo we have N(p,m,k) = N(p,mo,k). Let N(p, k) 
denote this number; N(p,k) > 0. 
(2) N(p,k) is the number of solutions of the equation 1+ X_+ 
--» 4+ X;, =0 by elements in U,. 


PROoF. (1) We have seen in (4H) that 0 < N(p,m,k) < N(p,m— 
1,k). Hence there exists mp with the property indicated. 
(2) Let m > mp and let 


k 
1+ Se =0 (mod p™t'), 

1=2 
where p /'x;. Since oe = (mod p™*') then 


1+ Se ae =0 (mod p™*?), 


SO 
k 


1+ S- xpt!=ap™t' (mod p™*?) 
1=2 
for some a,0 <a < p—1. Since m+1 > mo, by (4H), F*(p,m + 
1,k) = 0; therefore necessarily a = 0, so 


1+>° oe" =0 (mod p™*). 


In this way we have established the mapping 


(22 mod p™*",... , 2, mod p 


— (x2 mod p™*?,... , 2, mod p 


a 


Uere) 


from the set S,,, of solutions of 
LA 4X?” =O" (ined p™*) 


to the corresponding set S,,,1. This is clearly an injective mapping. 

Starting therefore with any (v2 mod p™*+1,... , a, mod p™*?!) € 
Sino We obtain the sequences (Qm,i)m>mo, Where An; = a” (for 
i = 2,...,k). Since ae = a (mod p™*') then Qm+414 = Omi 
(mod p™), so the sequence (Qm,i;)m converges p-adically. 
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Let 04. = lim, seo One hen 1 og-6 =F a, = Oin Des: Since 
Pp’) = 1 (mod p™) for m > mp then a? * = 1, so aw € Uz 


Oni a 
and (aQ2,...,Q@z) € S, the set of solutions in U, of the equation 
1+ X2+---+X; =0. So we have an injective mapping o from S,,,, 
into S. 

On the other hand, if a; € Us C= Cis yas: (okt = Qs 5 -R) 
and 1+a2+-+-+a, =0, since a?-' = 1 then y*} = 1 (mod p™*) 


(for i = 2,...,k and all m sufficiently large; we may take m > mv). 
Pp (mod pres) 


IS oe ene and 


1+ ae Se ae =0(0 (mod ge) 
In particular, 
+a, to +2h, =0 (mod p%*), 


SO (%m.2,+-+;Lm.k) € Smo. It is now immediate that the mapping o 
associates with this solution in S,,,, the given solution in S, thus ¢ 


is surjective. We conclude that the number of elements in S' is equal 
to N(p,k) = #9 ng 


As a corollary, we have: 


(4R) If p = 1 (mod k) then there exist (p — 1)th roots of 1, 
Q2,...,Q% € Up, such that 1+ag+---+a, =0. 


Proor. By (4A), for every m > 1 the congruence 1 + Daan cree 6 
XP?” =0 (mod p”*!) has a nontrivial solution. In particular, N(p, k) 
> 1, hence by (4Q), there exist az,... , az € U; such that 1+ a2 + 
+a, =0. O 


We reproduce now the following tables, computed by Klésgen, 
which give N(p,m,k) for low values of the argument. Table for 
N(p,1,k): 
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3.4 #5 6 7 
5}0 9 O - 100 39 
7/2 15 60 340 1680 
11}0 31 24 1600 5250 
13) 2 33 200 2260 21630 
17|0 57 140 6220 50120 
19|2 51 390 6880 101430 


Table for N(p,2,k) (italicized are the values which are not the 
same as in the table above): 


3.4 = °«5 6 7 
5/0 9 O - 100 0 
7|2 15 60 340 1680 
11)}0 27 24 1090 2520 
13)2 33 150 1930 15540 
17}0 45 O 3160 945 
19)2 51 300 4600 44520 


To conclude this section, we shall discuss a heuristic method to 
indicate the probability for the congruence 


1+Y?+Z"=0 (mod p’) 


to have a given number of equivalent classes of nontrivial solutions. 
We exclude also the cyclic solutions from these considerations. 
Let 1 <a < p—2 and let 


1 
M, = {a mod p, — mod p, —(1 + a) mod p, — mod p, 
a 


1 
(1+ a) 


—~ mod ph. 


M, consists of six distinct elements (all different from 0,—1), except 
in the following cases: 
(a) a = 1,p—2or (p—1)/2; then M, = {1 mod p,(p — 2) 
mod p,(p—1)/2 mod p}; and 
(b) a4 1, a? =1 (mod p); then M = {a mod p,a? mod p}. 
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This latter case happens if and only if p=1 (mod 6). 

Thus, we have a partition of {1,2,... ,p — 2} into disjoint classes. 
If p = 6n +1, the number of such classes M, (with more than 
two elements) is equal to n. Indeed, if p = 6n+ 1, this number 
is 1+ (p—2—2-3)/6 = n. If p = 6n —1 then the number is 
again 1 + (p —2—3)/6 =n. In each class M, (with more than two 
elements) let @ be the smallest integer, 1 < @ < p— 2, such that 
ae M,. Clearly @ < (p—1)/2. 

If 1+y?+2? =0 (mod p?) then necessarily z = —(1+y) (mod p). 
With this solution we associate 7, and we note that (1+y)?—-1-—9? = 
0 (mod p?). If 1+ y’ + 2” = 0 (mod p’), this is an equivalent 
solution to the above one, if and only if y’ mod p € M,, that is, 
y =y. 

By Fermat’s little theorem, if t > 1 then (1+ 1t)? -1-—t? =0 
(mod p) hence (1 +t)? —1—t? = t(p)p (mod p”) where 0 < t(p) < 
p—1. So the solutions of the congruence correspond to the integers 
t, 1 <t< (p—1)/2 such that t(p) =0. 

We consider the sequence (t(p))1<(p—1)/2- 


(4S) Assuming that the sequences (t(p))t<(p—1y/2 are random, for 
all primes p= 6n +1 we have: 
(1) the probability that 1+ Y + Z? =0 (mod p) have only the 
trivial or cyclic solution is equal to ((p—1)/p)”. 
(2) The probability that the above congruence have r (nontrivial, 
noncyclic) equivalence classes of solutions is equal to 


1 fn (? — *) 
p'\r Pp , 
(3) The density of primes for which there are exactly r (nontrivial, 
noncyclic) equivalence classes of solutions is equal to 


1 1 
rl 6" ve 


Proor. (1) As already indicated, there are n equivalence classes in 
{1,2,...,p— 2} consisting of at least three elements. Let us denote 
them by M:z,, Mz, sao , Mz... 

M;, consists of solutions of (1 +t)? = 1 + ¢? (mod p’) exactly 
when %;(p) = 0. Since the sequence (%;(p))z,<(p—1)/2 is random, the 
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probability is 1/p. Hence, the probability that none of the n classes 
M;,(p) consist of solutions is (1 — 1/p)”. 

(2) In the same way the probability that r among the n classes 
M;, consist of solutions is 


(3) The density in question is equal to 


S 

l| 

Te 

ab 
SIe 
a 
33 
ST 
——~ 
cs} 

S|] | 
—_ 
Sey 
3 
1 


| 


If p=6n+1 then 


i! 6n \or/6 di YE 
DS == fer (- = 
Grr! 190 (= as i) Grrl We 


Similarly, if p = 6n — 1 then 


PB 1 ‘ =e : (1 1 ‘ 
6Tr! n>00 \6n — 1 6n — 1 


1 1 
6'r! We 


Klésgen computed the solutions of the congruence 1+Y?+Z? =0 
(mod p?) for all primes p < 20000. 

If 0 < r let v;* (respectively, v~) be the number of primes p < 
20000, p = 1 (mod 6) (respectively, p = —1 (mod 6)) for which the 
above congruence has exactly r (nontrivial, noncyclic) equivalence 
classes of solutions. 

There are 1124 primes p such that p= 1 (mod 6) and p < 20000. 

Klosgen found that 


vg =970, vf /1124 = 86.30%, probability 84.35%. 
o. = AA: vy /1124 = 12.81%, probability 14.11%. 
Us = 93 vy /1124 = 0.80%, probability 1.18%. 
ve v3 /1124 = 0.09%, — probability 0.07%. 
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In a similar way for p = —1 (mod 6), p < 20000: 


ve = 957, vp /1136 = 84.24%, probability 84.35%. 
v; = 166, v, /1136 = 14.61%, probability 14.11%. 
05. = 13; vy /1136 = 1.15%, probability 1.18%. 
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XI 
Epilogue 


This book about Fermat’s last theorem was written for the enjoy- 
ment of amateurs. Most of the proofs are given in full detail and use 
only elementary and easily understandable methods. For this reason, 
it was imperative to exclude developments depending on the study 
of ideals of number fields or on more sophisticated theories. How- 
ever, in this final part we indicate the more important achievements 
which could not be dealt with using elementary methods. We also 
give a succinct description of the approach to the proof of Fermat’s 
last theorem. To help the reader who wants to know more about 
these matters, a bibliography of important articles is also included. 


X11. Attempts 


In this section, we give a brief overview of various approaches to the 
proof of Fermat’s last theorem. They were not quite successful but 
should not be dismissed. At their time, these results raised hopes for 
the proof of Fermat’s last theorem and led to new research problems 
of independent interest. 
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A. The Theorem of Kummer. 
In 1847, Kummer proved the following important theorem: 


Ifp > 2 is a regular prime, then Fermat’s last theorem is true for 
the exponent p. 


The concept of a regular prime needs explanation. It may be 
defined in terms of the class number of cyclotomic fields or by means 
of Bernoulli numbers. 

The Bernoulli numbers Bo, By, Bo,... are defined recursively: 


Bo=1 


and for n > 1, 


1 1 1 


Thus B, = 3, By = = B; =0,.... It is easily seen that Bop41 = 
0 for all k > 1. The prime number p is regular if p does not divide the 
numerators of the Bernoulli numbers Bo, By,... ,B, 5, B,—3. Let p 
be an odd prime, let 


¢p = cos(27/p) + isin(27/p) 


be a primitive pth root of 1. Let Q(G,) be the pth cyclotomic field; 
it consists of all complex numbers of the form 


To + TiGs ee ipa. 


with r9,71,.-. ,?p-2 € Q. The class number h, of Q(G,) is a certain 
positive integer attached to Q(¢,); it is the number of classes of 
ideals of Q(¢,), but we shall not explain these concepts any further 
(see any book on the theory of algebraic numbers, like the one by 
Borevich and Shafarevich (1966), or even this author’s own book 
(1999)). Kummer showed that the prime p is regular if and only if 
p does not divide hy. 

The smallest irregular prime is 37. It is known that there are in- 
finitely many irregular primes. On the other hand, it is conjectured, 
but it has never been proved, that there are infinitely many regular 
primes. 

The method of Kummer could be extended to deal also with many 
irregular primes. However, with these methods it was never possible 
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to establish that Fermat’s last theorem is true for infinitely many 
prime exponents. 

To determine if a prime p is regular is not a simple matter as 
soon as 7 is large, because the numerators of the Bernoulli numbers 
become very large. Noting that what is required is to ascertain 
that the exponent p does not divide the numerators of Bo,... ,B,—3 
(rather than calculating their numerators), Lehmer, Lehmer , and 
Vandiver gave a criterion which was possible to implement for actual 
calculations. In this way it was shown (at a time when the proof of 
FLT was not yet discovered) that FLT is true for all prime exponents 
up to 4 x 10° (see Buhler et al. (1993)). 
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B. The Theorem of Wieferich. 


In 1909, Wieferich proved: 


If the first case of FLT is false for the exponent p then 
2?-+ = 1 mod p’. 


This is a criterion involving only the exponent p and none of the 
hypothetical nonzero solutions x,y,z of X? + Y? = Z?. It was 
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immediately noted that no very small prime p satisfies the above 
congruence. Before the age of computers Meissner proved in 1913 
that p = 1093 is the smallest prime with the above-mentioned prop- 
erty. It is difficult to imagine the amount of calculations which was 
required. A further example, p = 3511, was found by Beeger in 
1921. Further computations by Lehmer, Keller, Clark and lately by 
Crandall, Dilcher, and Pomerance have shown that no other prime 
p<4x 10” satisfies the congruence. 

Other criteria of a similar kind were discovered by Mirimanoff, 
Vandiver, Frobenius, Pollaczek, Rosser, and Granville and Monagan, 
namely, 


If the first case of FLT is false for the exponent p then 
pP-) = 1 mod p? 


for all primes 1 < 89. 


A clever combinatorial combination of these criteria, by Gunder- 
son and Coppersmith, followed by extensive calculations (Granville 
and Monagan, Tanner and Wagstaff), allowed us to show that the 
first case of FLT is true for every exponent p < 6.93 x 101”. All this 
was done before the discovery of the proof of FLT for all exponents. 
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C. The First Case of Fermat’s Last Theorem for Infinitely 
Many Prime Exponents. 


Using methods from sieve theory, Adleman, Heath-Brown, and Fou- 
vry proved in 1985: 


There exists an infinite set S of prime numbers, such that the first 
case of Fermat’s last theorem is true for every exponent p€ S. 


A stronger result, valid not only for the first case, could not be 
established with the same methods. This theorem represented an 
important advance at that time. The method of proof was inspired 
from the old ideas of Sophie Germain and was connected with the es- 
timation of the size of the smallest prime in arithmetic progressions; 
the use of refined sieve theory was essential. 

The infinite set S, guaranteed by the theorem, is not effectively 
defined, so it is not possible, with the method of the proof, to deduce 
for any given p that the first case of FLT holds for p. 

Once again this substantial theorem is obsolete, due to the proof 
of FLT for all exponents n > 2. 
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D. The Theorem of Faltings. 


Mordell observed and conjectured that irreducible curves defined by 
homogeneous polynomials of high degree in three variables with ra- 
tional coefficients should have only finitely many rational points, 
when they have few singularities, all of lower order. The exact con- 
jecture is expressed in terms of the genus of the curve, a concept 
which will not be explained here. In a remarkable paper Faltings 
proved, among many other theorems, Mordell’s conjecture. In the 
particular case of Fermat’s equation his result becomes: 
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For every n > 3, there exist at most finitely many triples (x, y, z) 
where x,y,z are integers, not all equal to 0 and such that gcd(z, y, z) 
=1anda2”+y" =2". 


Despite its importance, this result could not lead to the proof of 
FLT. However it was used, independently by Granville and Heath- 
Brown, to deduce that the set of exponents n > 3 for which FLT is 
true has density one. This method of Granville or Heath-Brown is 
also applicable to a very wide class of exponential diophantine equa- 
tions (see Ribenboim, 1993), the conclusion being the zero density 
for the exponents for which the equations have nontrivial solution. 
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E. The (abc) Conjecture. 


The (abc) conjecture, attributed to Masser and Oesterlé, was in- 
spired by a result about polynomials, due to Mason. The conjecture 
is stated as follows: 
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For any € > 0 there exists a number C(e€) > 0 such that if a,b,c 
are integers, 1<a<b<c, withc=a+b and gcd(a,b,c) = 1 then 
c<C(e)r'**, where r is the product of the distinct prime factors of 
abc. 


Intuitively, as an example, if a = 2, b = 3” (with m,n large) 
then c= a+) is large, so the conjecture states that c must have a 
large prime factor or a large number of prime factors, so that r is 
large. 

It is easy to show that the (abc) conjecture implies: 


FLT is true for all sufficiently large exponents. 


Indeed, let n > 3 and assume that x,y,z are positive integers, 
such that ged(x,y,z) = 1 and a” + y” = 2". Let € = 3, so by the 
(abe) conjecture 2” < C(4)r*/? where 


r= | v= [pe ayz<2': 


plurynz” p|zyz 


so 2" < C(4)z°”?. This shows that there exists no such that n < no, 
in other words, FLT is true for every exponent n > no, or in short, 
FLT is asymptotically true. 

The (abc) conjecture is known to imply many other statements in 
number theory which have never been proved, as well as Mordell’s 
conjecture which was proved by Faltings. The proof of the (abc) 
conjecture should be very difficult and this is presently the object of 
intense research. 
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X1.2. Victory, or the Second Death of Fermat 


Mathematicians have the obligation of solving problems. When a 
long-sought proof, like the one for Fermat’s last theorem, is finally 
discovered, it is the moment of crying VICTORY. 

On June 23, 1993, in the third of his lectures at the Newton Insti- 
tute in Cambridge, England, Wiles announced the proof of Fermat’s 
last theorem. His manuscript, scrutinized by various experts, re- 
vealed flaws which needed corrections. Undeterred and with the help 
of Taylor, Wiles found a way out of the difficulties and in October 
1994, he made public two manuscripts, one co-authored by Taylor. 
They contain the proof of the conjecture of Shimura-Taniyama, for 
the case of semistable elliptic curves. According to the previous work 
of Ribet this entails that FLT is true. For most mathematicians this 
represents the end of the saga. Wiles deserves the admiration of all 
mathematicians for his achievement. The method used has already 
been applied to other diophantine equations. Wiles’ work was the 
final step in a new strategy which will be evoked shortly. 

There are some mathematicians who are not satisfied with the 
method of proof using elliptic curves and modular forms, considered 
— perhaps wrongly? or rightly? — to be extraneous to the problem. 
It is a legitimate task to try to find another, simpler, proof of FLT. 
But the solution of Fermat’s problem also harbors a negative aspect 
and a tear of regret is unavoidable, because mathematicians also 
like unsolved problems to stimulate their research, just like night 
butterflies are attracted by intense sources of light. The study of 
Fermat’s theorem led to the creation of the theory of algebraic num- 
bers, in the same way as the study of quadratic fields was prompted 
by Gauss’ theory of quadratic forms. The branch of mathematics 
which is the confluence of number theory and algebraic geometry, 
called Arithmetic Algebraic Geometry, developed not only by its in- 
ternal problems, but also in view of solving Fermat’s last theorem. 
The attempts to prove Fermat’s theorem, the old and the new, show 
a myriad of interesting ideas in many directions of number theory, 
by illustrious names. Will this stimulation disappear now that FLT 
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is proved? Not at all. Variants of the problem, generalizations to 
higher dimensions, will continue tantalizing mathematicians. So, 
we celebrate this striking victory and admire our colleagues who, 
through effort and ingenuity, succeeded in solving the problem. 
The proof of Fermat’s last theorem must be indirect. We assume 
that there exists n > 3 and positive integers a, b,c such that a”+b" = 
c”. The aim is to deduce a statement which is known to be false. 
No contradictions were found with statements in elementary number 
theory, nor with statements about number fields, nor for that matter, 
for any other statements until the expression of FLT in terms of 
elliptic curves. The proof of FLT was established with the following 
steps: 
(1) To associate an elliptic curve to a hypothetical nontrivial 
solution of Fermat’s equation, with arbitrary exponent n > 5. 
(II) To obtain a contradiction to the assumption of validity of a 
certain conjecture about elliptic curves and modular forms. 
(III) To prove the validity of the conjecture. 


These steps require sophisticated concepts and theories, far be- 
yond the level of this book and the knowledge usually expected from 
amateurs — and also from professional mathematicians working in 
other disciplines. My task is difficult if not hopeless. What will 
follow is simple-minded and superficial, but still mysterious and per- 
haps out of grasp for anyone who is not yet familiar with the concepts 
involved. The key notions needed are elliptic curves, modular forms, 
and Galois representations. 


A. The Frey Curves. 


For relatively prime positive integers A, B and A divisible by 16, 
Frey considered the elliptic curve of equation 


(2.1) Y? = X(X — A)(X +B) 


(see Chapter VIII, §1, (A6)) and studied its properties. 

If Fermat’s last theorem is false for the prime exponent q > 5, 
let a,b,c be positive pairwise relatively prime integers, with a even, 
such that a? + 6? = ct. Let A = a’, B = b%. The associated 
Frey curve displayed properties in sharp contrast with those of other 
elliptic curves. Frey became convinced that such a situation was not 
possible and envisioned a method to derive a contradiction with the, 
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by then well-known, conjecture of Shimura-Taniyama (see below). 
But there were serious obstacles to overcome, which would require 
many years of work (see below). 

Here are some propeties of the Frey curves. The minimal discrim- 
inant of the Frey curve is 


— a4h74(at + b1)?  (abc)?4 


28 28 
Since A # 0, the curve is nonsingular, so it is an elliptic curve. 
For every prime p not dividing A, we consider the congruence 


(2.2) Y? = X(X — a’)(X +b‘) (mod p). 


It defines a curve in the two-dimensional space over the finite field F,,. 
Since p does not divide A, the curve is nonsingular, so it is an elliptic 
curve. On the other hand, if p divides A, the curve is singular. The 
type of singularities is encoded in the invariant called the conductor. 
The primes p dividing the conductor are exactly those dividing the 
discriminant, that is, the primes p for which the curve in F,, x F, has 
singularity. The exponent of p indicates the type of singularity. In 
the present case, where the singularities are nodes, the conductor NV 
is square-free, so it is equal to 


N=|[p. 


pi|A 


Elliptic curves with square-free conductor are said to be semistable. 
Thus, Frey curves are semistable. As it was known, if Fermat’s last 
theorem is assumed false for the prime exponent gq, then q has to be 
very large; moreover, since Fermat’s equation is homogeneous, the 
discriminant is a power — and this seemed unlikely to be possible. 
We shall count the number of points of Frey’s curve modulo p (for 
every p not dividing A). To this count we add 1, which corresponds 
to the point at infinity in the associated projective curve. Let Vv, 
be the number of points and let a, = p+ 1— vv, (a, need not be 
positive). We pause to recall that in Chapter I, §1, we studied the 
Pythagorean equation X?+Y? = 1 modulo all odd primes; we proved 
that the numbers a, defined there are easily determined by a simple 
congruence for the prime p. Similar considerations are important for 
all elliptic curves (not only for Frey curves). The discriminant, the 
conductor, and the integers a, (for p not dividing the discriminant) 
are defined and studied in the same spirit. Elliptic curves which 
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can be given by an equation with coefficients in Q are said to be 
defined over Q. The rule for determination of the integers a, involves 
modular forms. 


B. Modular Forms and the Conjecture of Shimura-Tani- 
yama. 


Let N > 1 be an integer. Let ['o(V) be the set of all 2 x 2 ma- 


trices 
a b 
ec da)’ 


where a,b, c,d are integers, N divides c and ad— bc = 1. To(N) isa 
multiplicative group called the congruence group of level N. Let H 
denote the upper half-plane, that is, H = {z=x+iyeEC|y> 0}. 
To(NV) acts on A as follows: 


a b az +b 
(2.3) & t= eg 


for all matrices of [o(N) and z € H. Associated to the group 
To(N) there are finitely many special points (which we do not define 
here), called cusps; these are the point at infinity of the half-line 
{iy | y > 0} and other points in H U Q (when N > 1). 

A modular form of level N (and weight 2 — the only ones we wish 
to consider) is a map f from H* = H U {cusps of I'9(N)} to C such 
that: 


(i) for all . D €To(N) and z € H*: 
(2.4) f (S48) = e+ 0770: 


(ii) f is holomorphic at every point of H* (this requires an ap- 
propriate definition at the cusps). 

A modular form which vanishes at all cusps is called a cusp form. 

The theory of modular forms is very rich. Here are some relevant 
facts (for which we give no hint of proof): 


(1) The set M2(NV) of modular forms of level N and weight 2 is a 
finite-dimensional vector space over C and the subset of cusp forms is 
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a subspace. For the level N = 2 the subspace of cusp forms consists 
only of the form 0. 

(2) There is a natural inner product on M.(N), so it is possible 
to consider orthogonality in M.(N). 

(3) Let N > 1. If M divides N then M2(M) C M2(N). There 
is also the embedding from M.2(M) into M2(N) given as follows: 
if f € M2(N) let f(z) = f((N/M)z) for every z € H*; then f € 
M2(N). 

(4) A form f € M.(JN) is called an old form if f is in the subspace 
of M.(N) generated by the images of the mappings considered in 
(3), for all M dividing N. A form f € M2(JN) is called a new form 
if it is in the subspace which is orthogonal to the subspace of old 
forms. 


(5) Since 
i : ) €To(N) 


then f(z +1) = f(z) for each modular form and every z. Thus f 
has a Fourier expansion, which is of the form 


(2.5) {Qh = Soe 


For cusp forms, co = 0. 

(6) Hecke defined for each n > 1 coprime to the level N, a linear 
operator T,, of M2(N). The Hecke operators commute: T,, 0 T;, = 
T,, © T;, for all m,n coprime to the level. A modular form which is 
an eigenvalue for all Hecke operators 7), is called an eigenform. 

Other operators associated to the integers n, not coprime with 
N, have also been introduced and, together with the above Hecke 
operators T,,, they generate a larger Hecke algebra, whose properties 
are of essential importance (see Wiles and Taylor, and also Lenstra). 
New forms of level N, which are eigenforms for each T,, (with n 
coprime to NV) are also eigenforms of the operators of the larger 
Hecke algebra. 

Now we discuss the relationship between elliptic curves and mod- 
ular forms. For a given elliptic curve, the numbers a, (for all primes 
p not dividing the discriminant) contain very important “local” in- 
formation about the curve (for each p). It is crucial to relate these 
local data by means of some “global” invariant. 
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This important idea is a sophisticated transfiguration of the fact 
that every natural number is the product of powers of primes in a 
unique way. Thus Euler already introduced this relation between an 
infinite product extended over all primes and infinite Dirichlet series, 
summed over all integers: 


ne-3)'-£2 


P p 


First s was restricted to be a real number s > 1, for which both sides 
converge and are equal. Riemann had the idea, courageous and deep, 
of allowing s to be any complex number with Re(s) > 1. The above 
series is the Riemann zeta function. To prove the existence of in- 
finitely many primes in arithmetic progressions, Dirichlet considered 
“twisted” L-series, where the numerators are no longer 1, but val- 
ues of characters of appropriate finite Abelian groups; each series 
has also an abscissa of convergence and admits an Euler product, 
reflecting also the multiplicative property of characters. 

As for the Riemann zeta function, [-series of characters have only 
poles but no essential singularities at the boundary of the domain 
of convergence. Riemann proved that the functions definded above 
could be extended to the whole plane by analytic continuation, and 
even more remarkably, the values to the right and left of the bound- 
ary line are linked by a functional equation involving the gamma 
function. A great discovery and the royal road for analytical meth- 
ods to enter into number theory. 

In great analogy with number fields, elliptic curves also display 
very important analytical properties of the same kind. The local 
numbers a,, defined above (not forgetting finitely many factors at- 
tached to the primes dividing the discriminant), combine together 
multiplicatively to define numbers a,, (for every n > 1), thus leading 
to a Dirichlet series, called the L-series of the elliptic curve; they 
converge for Re(s) > 3. In computed examples it was observed that 
these L-series admit analytic continuations and functional equations. 
Hasse conjectured that this should be true for every elliptic curve. 
Deuring proved it for the elliptic curves admitting more “symme- 
tries,” namely those with complex multiplication. 

For a certain time it had been observed by numerical calculations 
that for many specific elliptic curves the numbers a, coincide with the 
coefficients c, of the Fourier series of some modular form. Elliptic 
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curves with the above property have been called modular elliptic 
curves, or also Weil elliptic curves. 

In 1955, during the Tokyo—Nikko conference on number theory, 
Taniyama proposed problems, two of which concerned — if still 
somewhat imprecisely — the above question. If Hasse’s conjecture 
were true, would the L-series be associated to some automorphic 
function, or even to a modular form? These problems were dis- 
cussed with Shimura and Weil. By 1964, Shimura made known in 
his lectures a very specific conjecture (which however did not appear 
in print on that occasion). Weil contributed in an important way to 
the investigation of the modularity of elliptic curves. His paper (of 
1967) acknowledges previous communications by Shimura but does 
not contain a statement of the conjecture which he considered, even 
later, to be problematic. According to a well-documented study by 
Lang (1995), we shall adopt the name “Shimura—Taniyama conjec- 
ture” for this penetrating statement. 


(7) The Shimura—Taniyama Conjecture: Every elliptic curve 
is modular. 


This is a short way of expressing the following: 


If E is any elliptic curve defined over Q, if N is its conductor, then 
there is a new cusp eigenform f of level N, whose Fourier coefficients 
Cn, are integers and such that for every prime p not dividing N, cp = 
a, (where a, is defined by counting the number of points of E in F,). 


This conjecture says that the rule of determination of the integers 
Gp is given by some modular form. 

(8) Shimura proved the converse of the Shimura—Taniyama con- 
jecture. Let f € M2(N) be such that its Fourier coefficients are in 
Z. We explain how it is possible to associate an elliptic curve. Let 
zy € H. For each y € T'o(N) consider the integral 


(20) 
walt) = fo f(a) de; 


it is independent of the path. The set {w.,(7) | y € To(NV)} is 


independent of zo, so it depends only on f. Using the fact that 
the Fourier coefficients of f are integers, the above set is a lattice 
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in H, that is, it is the set of all linear combinations, with integral 
coefficients, of two numbers in H* (the periods of f). This lattice 
gives rise in the usual manner to an analytic torus, hence to an elliptic 
curve / having an equation with coefficients in Z (thus E is defined 
over Q). Let C2(N) = {f € M2(N) | f is a cusp eigenform whose 
Fourier coefficients are integers}. The above construction associates 
to each f € C2(N) an elliptic curve E defined over Q. Moreover, the 
conductor of F is the level N of f and for each prime p not dividing 
the discriminant of EF, the Fourier coefficient c, of f is equal to the 
number a, (associated to E and p as was already indicated). 

Analytical methods involving the L-series of elliptic curves, their 
Euler product, analytic continuation, and functional equation play 
a fundamental role. 


C. The Work of Ribet and Wiles. 


The work of Ribet involved an argument of descent concerning Ga- 
lois representations and modular forms. We need to explain how 
Serre attached Galois representations to any elliptic curve EF defined 
over Q, that is, having an equation with integral coefficients. The 
set of points with complex coordinates (to which is added the point 
at infinity) constitutes an Abelian additive group, well defined by 
the stipulation that the point at infinity should be the zero for the 
addition. The addition is defined by the following rule: if P,Q,R 
are points on the curve, then P+ @Q-+ R = 0 when P,Q, R lie on 
one line (clarifications are needed when P = Q or in some other 
special cases). If kK is a subfield of C, let E(k) be the set of pairs 
of elements of K which satisfy the equation of E; then E(K) is a 
subgroup of E(C). 

For each prime p, it is equally possible to define the additive group 
E(F,). In the Abelian group E(C) we consider the set E(C)[p] of all 
elements of order dividing p. These are the point 0 and the points 
P such that P+ P+---+ P (p times) is equal to 0. Then E(C)[p] 
is a subgroup of order p?, which is isomorphic to Z/p x Z/p. The 
coordinates of the points in E(C)[p] are in some Galois extension Kv 
of finite degree over Q. The elements of the Galois group of K|Q 
act linearly on E(/) and permute among themselves the elements 
of E(C)|p]; by isomorphism with Z/p x Z/p this gives rise to linear 
transformations of Z/p x Z/p. Thus, we obtain a representation, 
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associated to E, of the Galois group of K|Q. It is usual to consider 
the field Q of all algebraic numbers; it has infinite degree and con- 
tains K. The Galois group of K|Q is a quotient of the Galois group 
G of Q|Q. So we obtain a representation pz, from G by a group 
of 2 x 2 matrices with entries in Z/p = F, (attention is also paid 
to the natural Krull topology of G). Similar considerations lead to 
representations Pp)» by means of 2 x 2 matrices with entries in Z/p” 
(for all n > 1). All the representations ppp» (for n > 1) fit together 
to produce a representation pz,g, of G by 2 x 2 matrices with entries 
in the field Q, of p-adic numbers. 

It is also possible to attach to any eigenform f with Fourier coeffi- 
cients in Z, a representation py.q, of G by 2 x 2 matrices with entries 
in Q,. If E is the elliptic curve associated to f, as indicated before, 
then the representations py g, and pz,g, are isomorphic. 

We outline the proof of Ribet. Assume that FLT is false for the 
exponent q, let EF be the Frey curve associated to a hypothetical so- 
lution; F is a semistable elliptic curve. Assuming that the conjecture 
of Shimura-Taniyama is valid, there exists a new cusp eigenform f of 
weight 2 and level equal to the conductor N of E. Then ps9, = pza,- 

Ribet proved that if p is an odd prime dividing N, N, = N/p, 
then there is a new cusp eigenform f, of weight 2 and level N,, such 
that ps, 7, = Pppr,- The argument may be repeated, leading to a 
nonzero cusp form of weight 2 and level 2 — which is impossible. 

Wiles proved that the Shimura—Taniyama conjecture is valid for 
semistable elliptic curves, in particular for the Frey curve. The the- 
ory of deformation of representations, created by Mazur, plays a 
great role; so does the result (proved with Taylor, and also later 
by Lenstra) on the structure of the commutative algebra generated 
by the Hecke operators. Cohomological results were developed and 
used in essential ways. The proof is at a maximal level of sophistica- 
tion, so it is impossible to report in an intelligent way in this book. 
Among the expository papers listed in the Bibliography, we may 
recommend the one by Gouvéa which delineates the proof, avoiding 
technical details, making it accessible for the courageous amateur. 
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For my readers who are still courageous, I include a bibliography 
which lists not only research papers, but also expository material 
and should be explored by readers wishing to enter deeper into the 
proof of Wiles. 

For the convenience of the reader, the references are organized as 
follows: 


A. Elliptic Curves, Modular Forms: Basic Texts. 


1962 Gunning, R.C., Lectures on Modular Forms, Princeton Uni- 
versity Press, Princeton, NJ, 1962. 

1971 Shimura, G., Introduction to the Theory of Automorphic 
Functions, Princeton University Press, Princeton, NJ, 1971. 

1972 Ogg, A., Survey of modular functions of one variable, in: 
Modular Functions of One Variable (editor, W. Kuyk), Sprin- 
ger-Verlag, New York, 1972. 

1974 Tate, J., The arithmetic of elliptic curves, Invent. Math., 23 
(1974), 179-206. 

1976 Lang, 8., Introduction to Modular Forms, Springer-Verlag, 
New York, 1976. 

1984 Koblitz, N., Introduction to Elliptic Curves and Modular 
Forms, Springer-Verlag, New York, 1984. 

1986 Silverman, J.H., The Arithmetic of Elliptic Curves, Springer- 
Verlag, New York, 1986. 

1986 Cornell, G. and Silverman, J.H. (editors), Arithmetic Geom- 
etry, Springer-Verlag, Berlin, 1986. 

1989 Miyake, T., Modular Forms, Springer-Verlag, New York, 1989. 

1989 Hida, H., Theory of p-adic Hecke algebras and Galois repre- 
sentations, Sigaku Expositions, 2 (1989), 75-102. 

1989 Gouvéa, F.Q., Formas Modulares, uma Introducdéo, Instituto 
de Matematica Pura e Aplicada, Rio de Janeiro, 1989. 

1991 Cassels, J.W.S., Lectures on Elliptic Curves, Cambridge Uni- 
versity Press, Cambridge, 1991. 

1992 Tate, J. and Silverman, J.H., Rational Points on Elliptic 
Curves, Springer-Verlag, New York, 1992. 
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B. Expository. 
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Cipra, B.A., Fermat’s last corollary?, Focus, March—April 
1988, pp. 2 and 6. 

Shimura, G., Yataku Taniyama and his time. Very personal 
recollections, Bull. London Math. Soc., 21 (1989), 186-196. 
Ribet, K.A., From the Taniyama-Shimura conjecture to Fer- 
mat’s last theorem, Ann. Fac. Sci. Toulouse Math., (5), 11 
(1990), no. 1, 116-139. 

Murty, M. Ram, Fermat’s last theorem, an outline, Gaz. Soc. 
Math. Québec, 16 (1993), No. 1, 4-13. 

Murty, M. Ram, Topics in Number Theory, Mehta Res. Inst. 
Lect. Notes, No. 1, Allahabad, 1993. 

Frey, G., Uber A. Wiles’ Beweis der Fermatschen Vermutung, 
Math. Semesterber., 40 (1993), no. 2, 177-191. 

Ribet, K.A., Modular elliptic curves and Fermat’s last the- 
orem, Videocassette, 100 min., Amer. Math. Soc., Provi- 
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As aconcluding note, here are timely communications by Karl Rubin 
which circulated widely. 


E-mail message no. 1: 


Date: June 23, 1993, 05:52:30 

Subject: big news 

Andrew Wiles just announced, at the end of his 3rd lecture here, 
that he has proved Fermat’s Last Theorem. He did this by proving 
that every semistable elliptic curve over Q (i.e. square-free conduc- 
tor) is modular. The curves that Frey writes down, arising from 
countereramples to Fermat, are semistable and by work of Ribet they 
cannot be modular, so this does tt. 

It’s an amazing piece of work. 

Karl 
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E-mail message no. 2: 


Date: Oct. 25, 1994, 10:24:46 
Subject: update on Fermat’s last theorem 
As of this morning, two manuscripts have been released: 


Modular elliptic curves and Fermat’s last theorem, 
by Andrew Wiles 

Ring theoretic properties of certain Hecke algebras, by 
Richard Taylor and Andrew Wiles. 


The first one (long) announces a proof of, among other things, 
Fermat’s last theorem, relying on the second one (short) for one 
crucial step. 

As most of you know, the argument described by Wiles in his Cam- 
bridge lectures turned out to have a serious gap, namely the construc- 
tion of an Euler system. After trying unsuccessfully to repair that 
construction, Wiles went back to a different approach, which he had 
tried earlier but abandoned in favor of the Euler systems idea. He was 
able to complete his proof, under the hypothesis that certain Hecke 
algebras are local complete intersections. This and the rest of the 
ideas described in Wiles’ Cambridge lectures are written up in the 
first manuscript. Jointly, Taylor and Wiles establish the necessary 
property of the Hecke algebras in the second paper. 

The overall outline of the argument is similar to the one Wiles de- 
scribed in Cambridge. The new approach turns out to be significantly 
simpler and shorter than the original one, because of the removal of 
the Euler system. (In fact, after seeing these manuscripts, Faltings 
has apparently come up with a further significant simplification of 
that part of the argument.) 

Versions of these manuscripts have been in the hands of a small 
number of people for (in some cases) a few weeks. While it is wise 
to be cautious for a little while longer, there is certainly reason for 
optimism. 

Karl Rubin 


Excitement, caution, and amazement in the face of a superlative 
feat in Mathematics. 


Appendix A 
References to Wrong Proofs 


It is well known that there have been literally thousands of wrong 
proofs of Fermat’s last theorem. This can be explained by the fact 
that the statement of the problem is easily understandable to an 
amateur. Moreover, there have been important prizes offered by 
academies and foundations which have stimulated efforts by dilet- 
tantes as well as professional mathematicians. 

Since the Wolfskehl Prize was established in 1908, in the first 
years alone, 621 wrong solutions were submitted, and today there 
are about 3 meters of file correspondence and proposed solutions of 
Fermat’s problem stored in G6ttingen. 

We indicate below a list — obviously incomplete — of some noto- 
riously wrong published attempts to solve the problem. Even good 
professional mathematicians have not escaped from being included 
in the roll. 

Remarkable is the case of F. Lindemann, who discovered the tran- 
scendency of the number z. Yet, with respect to Fermat’s last the- 
orem, all his attempts failed. 

F. Paulet seems to have been one of the most persistent, with 
twelve submissions to the Academy of Sciences of Paris, spanning 
the years 1841 to 1862, but not bringing any progress to the investi- 
gation. 
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With only a few exceptions, we do not mention wrong solutions 
published by the authors as independent books or brochures; some 
of these have been listed by Fleck and Maennchen (1908-1912), Mi- 
rimanoff (1909), and Perron (1916). Instead, we concentrate only on 
the intended, but failed solutions, published in mathematical jour- 
nals or proceedings of conferences. 

First, we give a selection of books or papers containing references 
to wrong proofs. This is followed by a list of wrong papers, including 
an indication of where the mistake is discussed. 


I. Papers or Books Containing Lists of Wrong Proofs 


1908 Hoffmann, F., Der Satz vom Fermat. Sein seit dem Jahr 
1658 gesuchter Beweis, J. Singer, Strasbourg, 1908. 

1909 Lampe, E., Jahrbuch Fortschritte Math., 40 (1909), 258— 
261. 

1909/10/11/12/16 Fleck, A. and Maennchen, A., Vermeintliche 
eon des Fermatschen Satzes, Arch. Math. Phys., (3), 

Be 284-286, 370-372; 

5 (1909), 108-111; 

6 (1910), 105-109 and 372-375; 

Se 108-109 and 370-374; 
), 
), 


~~~ 


1912), 105-109 and 204-206; 
Shinie 267-268. 

1910 ea B., Uber das ae Fermatsche Theorem, Abh. Gesch- 
ichte Math. Wiss., 26 (1910), 23-65. 

1920 Dickson, L.E., ae of the Theory of Numbers, Vol. II, 
Carnegie Institution, Washington, DC, 1920; reprinted by 
Chelsea, New York, 1971. 

1973 Besenfelder, H.J.,! Das Fermat-Problem, Diplomarbeit, Uni- 
versitat Karlsruhe, 1973, 61 pp. 


II. Wrong Proofs in Papers 


1810 Barlow, P., Demonstration of a curious numerical proposi- 
tion, J. Nat. Phil. Chem. Arts, 27 (1810), 193-205. 


‘His family name has changed from “Besenfelder” to “Bentz” since Au- 
gust 1979. 
?This paper uses an incorrect result of Kapferer (1933). 
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TABLE 8. Wrong proofs. 

Year Author Mistake Pointed Out by Year 
1810, Barlow Smith 1860 
1811 Talbot 1864 
1845 Drach Dickson, p. 738 1820 
1847 Lamé Liouville 1847 
Kummer 1847 
Dickson, pp. 739/40 1920 
1855 Calzolari Lind, p. 48 1910 
/57/64 Dickson, pp. 743, 744, 746 1920 
1864 Gaudin Dickson, p. 746 1920 
1890 Lefébure Pepin 1880 
1889 Varisco Landsberg 1890 
Dickson, p. 754 1920 
1893 Korneck Picard and Poincaré 1894 
Dickson, p. 756 1920 
1901/ Lindemann Fleck and Maennchen 1909 
1907/ Furtwangler, Fleck 1909 
1909 Ivanov 1910 
Dickson, pp. 759, 762 1920 
1908 Werebrusow Dickson, Worms de Romilly, 1908 

Duran-Loriga, Curjel 
Dickson, p. 762 1920 
1910 Lind Fleck 1910 
Dickson, p. 760 1920 
1913 Fabry Mirimanoff 1913 
1955 Becker, W. W. | Eggan 1981 
1956 Fraga Torrejon | Rodeja, F. 1956 
1957 Villasenor, Z. | Math. Rev., 19 (1958), No. 251f | 1958 
1957 Noguera Math. Rev., 19 (1958), No. 16e | 1958 
Barreneche | Math. Rev., 20 (1959), No. 1658 | 1959 
1958/9 | Draeger Morishima 1960 
1958/73 | Yahya? Gandhi and Stuff 1975 
1973/77 Inkeri 1984 
1966 Sarantopoulos | Garrison 1967 
1978 Zinoviev Kreisel 1978 
1979 Clarke and Oral communication by 1983 

Shannon J. H. Ursell 

1980 Maggu Eggan 1980 
1980 Lallu-Singh Yamaguchi 1982 
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1811 


1845 


1847 


1847 


1847 


1847 


1847 


1847 


1847 


1855 


1857 


1860 


1864 


1864 


1864 


Barlow, P., An Elementary Investigation of Theory of Num- 
bers, pp. 160-169, J. Johnson, St. Paul’s Church-Yard, Lon- 
don, 1811. 

Drach, S.M., Proof of Fermat’s undemonstrated theorem that 
xz” + y”" = z” is only possible in whole numbers when n = 1 
or 2, Phil. Mag., 27 (1845), 286-289. 

Lamé, G., Mémoire sur la résolution en nombres complexes 
de l’équation A” + BX + C" = 0, J. Math. Pures Appl., 12 
(1847), 172-184. 

Lamé, G., Démonstration générale du théoréme de Fermat 
sur Vimpossibilité en nombres entiers de l’équation x" +y”" = 
2", C. R. Acad. Sci. Paris, 24 (1847), 310-314. 

Lamé, G., Note au sujet de la démonstration du théoréme de 
Fermat, C. R. Acad. Sci. Paris, 24 (1847), 352. 

Lamé, G., Second mémoire sur le dernier théoréme de Fer- 
mat, C. R. Acad. Sci. Paris, 24 (1847), 569-572. 

Lamé, G., Troisiéme mémoire sur le dernier théoréme de 
Fermat, C. R. Acad. Sci. Paris, 24 (1847), 888. 

Kummer, E. E., Extrait d’une lettre de M. Kummer a M. 
Liouville, J. Math. Pures Appl., 12 (1847), 136; reprinted 
in Collected Papers, Vol. I, p. 298. Springer-Verlag, Berlin, 
1975. 

Liouville, J., Remarques a l’occasion d’une communication 
de M. Lamé sur un théoréme de Fermat, C. R. Acad. Sci. 
Paris, 24 (1847), 315-316. 

Calzolari, L., Tentativo per dimostrare il teorema di Fermat 
sull’equazione indeterminata x” + y” = z", Ferrara, 1855. 
Calzolari, L., Dimostrazione dell’ultimo teorema di Fermat, 
Annali Sci. Mat. B. Tortolini, 8 (1857), 339-349. 

Smith, H.J.S., Report on the Theory of Numbers, Part IT, 
Art. 61, “Application to the Last Theorem of Fermat”, Col- 
lected Math. Papers, Vol. I, 1894, pp. 1381-137 Clarendon 
Press, Oxford, 1894; reprinted by Chelsea, New York, 1965. 
Calzolari, L., Impossibilita in numeri interi dell’equazione 
z?=a"+y" quando n > 2, Ann. Mat., 6 (1864), 280-286. 
Gaudin, A., Impossibilité de l’équation (a + h)" — a” = 2", 
C. R. Acad. Sci. Paris, 59 (1864), 1036-1038. 

Talbot, W.H.F., On the theory of numbers, Trans. Roy. Soc. 
Edinburgh, 23 (1864), 45-52. 


1880 


1880 


1889 


1890 


1893 


1893 


1894 


1901 


1907 


1908 


1908 


1908 


1908 


1908 


1909 


1909 
1909 
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Lefébure, A., Sur la résolution de l’équation x" + y” = z” en 
nombres entiers, C. R. Acad. Sci. Paris, 90 (1880), 1406— 
1407. 

Pepin, T., Sur diverses tentatives de démonstration du théo- 
reme de Fermat, C. R. Acad. Sci. Paris, 91 (1880), 366-367. 
Varisco, D., Ricerche aritmetiche contenente la dimostrazione 
generale del teorema di Fermat, Giorn. Mat., 27 (1889), 371— 
380. 

Landsberg, O., Lettera al redattore, Giorn. Mat., 28 (1890), 
52. 

Korneck, G., Beweis des Fermatschen Satzes von der Unmog- 
lichkeit der Gleichung x" + y” = z” ftir rationale Zahlen und 
n > 2, Arch. Math. Phys., (2), 13 (1893), 1-9. 

Korneck, G., Nachtrag zum Beweis des Fermatschen Satzes, 
Arch. Math. Phys., (2), 13 (1893), 263-267. 

Picard, E. and Poincaré, H., Rapport verbal sur les articles 
de M. G. Korneck, C. R. Acad. Sci. Paris, 118 (1894), 841. 
Lindemann, F., Uber den Fermatschen Satz betreffend die 
Unmdglichkeit der Gleichung x” = y" + z”, Sitzungsber. 
Akad. Wiss. Miinchen, Math., 31 (1901), 185-202; corri- 
genda, p. 495. | 

Lindemann, F., Uber das sogenannte letzte Fermatsche Theo- 
rem, Sitzungsber. Akad. Wiss. Miinchen, Math., 37 (1907), 
287-352. 


Dickson, L. E., Dernier théoréme de Fermat, L’Interm. Math. 


15 (1908), 174. 

Curjel, H. W., Dernier théoréme de Fermat (Question 612 
de Worms de Romilly), L’Interm. Math., 15 (1908), 247. 
Duran-Loriga, J. J., Sur le dernier théoréme de Fermat (Ré- 
ponse de M. Werebrusow), L’Interm. Math., 15 (1908), 177. 
Werebrusow, A. S., Impossibilité de l’équation x” = y” + 2” 
(Question 612 de Worms de Romilly), L’Interm. Math., 15 
(1908), 79-81. 

Worms de Romilly, A. S., Le dernier théoréme de Fermat, 
L’Interm. Math., 15 (1908), 175-177. 

Lindemann, F., Uber den sogenannten letzten Fermatschen 
Satz, Veit, Leipzig, 1909, 83 pp. 

Fleck, A. and Maennchen, A.: See in List I. 

Furtwaingler, P., Review of Lindemann’s “Uber den sogenan- 


’ 
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nten letzten Fermatschen Satz,” Jahrbuch Fortschritte Math., 
40 (1909), 258. 

1910 Ivanov, LI., Uber den von Prof. F. Lindemann vorgeschlage- 
nen Beweis des Fermatschen Satzes (Brief an die Redaktion), 
Jahrbuch Fortschritte Math., 41 (1910), 238. 

1910 Lind, B.: See in List I. 

1913 Fabry, E., Un essai de démonstration du théoréme de Fermat, 
C. R. Acad. Sci. Paris, 156 (1913), 1814-1816. 

1913 Mirimanoff, D., Remarque sur une communication de M. 
Eugéne Fabry, C. R. Acad. Sci. Paris, 157 (1913), 491-492. 

1920 Dickson, L.E., See in List I. 

1933 Kapferer, H., Uber die diophantischen Gleichungen 23 — y? = 
3° -2a**? und deren Abhdngigkeit von der Fermatschen Ver- 
mutung, Heidelberger Akad., Math. Naturwiss. Klasse, Abh., 
2 (1933), 32-37. 

1956 Fraga Torrején, E. de, Note on Fermat’s last theorem, Las 
Ciéncias, 21 (1956), 5-13. 

1956 Rodeja, F., E.G., On Fermat’s last theorem, Las Ciéncias, 21 
(1956), 382-383. 

1957 Noguera Barreneche, R., Solucién general de la ecuacion alge- 
braico-exponencial X” + Y” = Z”, Studia Rev. Univ. At- 
lantico, 2 (1957), 119-126. 

1957 Noguera Barreneche, R., Historically the first proof incon- 
trovertible, complete and universal of the grand theorem of 
Fermat, with the Davidic algebra of the “principle of the am- 
ateurs” in mathematical investigation (in Spanish), Studia 
Rev. Univ. Atlantico, 2 (1957), 199-209. 

1957 Villasenor Z., F., El celebre teorema de Fermat y su demon- 
stracién, Mexico, 1957, 127 pp. 

1958 Yahya, Q.A.M.M., Complete proof of Fermat’s last theorem, 
Author’s publication, Pakistan Air Force, Kohat, Pakistan, 
1958, 14 pp. 

1958/9 Draeger, M., Das Fermat-Problem, Wiss. Z. Techn. Hoch- 
sch. Dresden, 8 (1958/9), 941-946. 

1960 Morishima, T., Review of the paper by Draeger “Das Fermat- 
Problem,” Math. Rev., 23 (1960), A2375. 

1966 Sarantopoulos, S., Du premier cas du théoréme de Fermat. 
Bull. Soc. Math. Gréce (N.S.), 10 (1966), 76-115. 

1971 Garrison, B., Review of the above paper by Sarantopoulos 
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(with a remark by E.G. Straus), Math. Rev., 42 (1971), No. 
4483. 

1973 Yahya, Q.A.M.M., On general proof of Fermat’s last theorem, 
Portugal. Math., 32 (1973), 157-170. 

1975 Gandhi, J.M. and Stuff, M., Comments on certain results 
about Fermat’s last theorem, Notices Amer. Math. Soc., 22 
(1975), A-502. 

1978 Kreisel, G., Letter to Ribenboim (6 June 1978). Atlantis 
Hotel, Ziirich. 

1979 Clarke, J.H. and Shannon, A.G., Some observations on Fer- 
mat’s last theorem, New Zealand Math. Mag., 16 (1979), 
80-83. 

1979 Zinoviev, A. A., Complete (rigorous) induction and Fermat’s 
great theorem (with a report by G. Kreisel), Logique et Anal., 
22 (1979), no. 87, 243-263. 

1980 Maggu, P.L., On the proof of Fermat’s last theorem, Pure 
Appl. Math. Sci., 12 (1980), 1-9. 

1981 Eggan, L.C., Review of the above paper by Maggu, Math. 
Rev., 81g (1981), No. 10032. 

1984 Inkeri, K., On certain equivalent statements for Fermat’s last 
theorem — with requisite corrections, Ann. Univ. Turku., 
Ser. AI, 186 (1984) 12-22; reprinted in Collected Papers 
of Kustaa Inkeri (editor, P. Ribenboim), Queen’s Papers in 
Pure and Applied Mathematics, Vol. 91, Kingston, Ontario, 
1992. 


III. Insufficient Attempts 


We add to the above list some publications involving methods which 
are clearly insufficient to solve the problem. 


1951 Natucci, A., Osservazioni sul problema di Fermat, Bull. Un. 
Mat. Ital., (3), 6 (1951), 245-248. 

1953 Natucci, A., Ricerche sistematiche sull’ultimo teorema di Fer- 
mat, Giorn. Mat., (5), 1(81) (1953), 171-179. 

1975 Peiulescu, V., Teorema lui Fermat, Ed. Litera, Bucuresti, 
1975, 86 pp. 

1976 Yahya, Q.A.M.M., On general proof of Fermat’s last theorem— 
epilogue, Portugal. Math., 35 (1976), 9-15. 
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1977 Yahya, Q.A.M.M., Fermat’s last theorem—a topological ver- 
ification, Portugal. Math., 36 (1977), 25-31. 

1979 De Fermate, J.F. (pseudonym of Guillotte, G.), A Famous 
Problem in Number Theory of the First Kind Dating from 
1687: x? +y? = z?,p > 2, Cowansville Printing, Cowansville, 
Quebec; Vol. I, 1979, 22 pp.; Vol. II, 1980, 8 pp. 

1980 Singh, L., The general proof of Fermat’s last theorem, J. In- 
dian Acad. Math., 2 (1980), 43-50. 

There have also been quite a number of mistakes in papers re- 
lated to Fermat’s last theorem, aiming to establish partial results, 
necessary conditions, etc. We have referred to these errors at the 
appropriate place in the text. 


Appendix B 
General Bibliography 


I. The Works of Fermat 


After Fermat’s death, his collected papers were published under the 
supervision of his son Samuel de Fermat. 


1679 Varia Opera Mathematica, D. Petri de Fermat, Sen- 
atoris Tolosani. Tolosae, Apud Joannem Pec, Comi- 
tiorum Fuzensium Typographum justa Collegium P 
P. Societatis JESU. 


These books were reprinted in four volumes and a supplement: 


1891, 1894, Oeuvres de Pierre de Fermat, 
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